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PREFACE TO THE SECOND EDITION 


F ROM its inception, the aim of this book has been to bring the reader 
to an intermediate level of attainment in the main branches of 
theoretical physics from which he may be able to proceed, with the 
help of special literature, to the field of research. Experience shows that 
it is often the first contact with a complex problem that presents the 
major difficulty, and it is here that help is most urgently needed. 
Nevertheless, it is necessary to make a certain selection from the many 
topics that seem to offer promise of development. For example, there 
is the question of whether an extensive treatment of the Hamilton- 


Jacobi mechanics, which at the time of the first edition (1932) formed 
the basis of the old atom theory, is still advisable. The decision to 
retain it is founded on the conviction that only in this way is the com- 
pelling and logical evolution of atomic physics in the form of wave 
mechanics clearly seen. 

Ensuing revisions of the book have made possible the recasting of 
the presentation of many topics. The subject of nuclear physics re- 
quired, of course, the most extensive changes. In this connexion it is 
felt that a concise survey of the main lines of cosmic ray research is 
now in order, since knowledge in this field has progressed far beyond 
the stage of mere speculation. In this brief presentation much of the 
great mass of experimental material has ten omitted, and it is hoped 
that specialists m this field will not bctoo critical of what is apparently 
a somewhat oversimplified picture. Geometric optics has been given 
more space and the theory of the top has been modernized on the 
basis of a treatment suggested by Professor Bauersfeld 

The range of subject matter has been extended by the inclusion of 
selected topics m what might be called “ applied theoretical physics 
To have me uded these items in the respective chapters d'ealing with 

and' sort 5 * h W0U i d haVe lntefnlpted the continuity envelopment, 

in i haVe b “ n grouped “ a separate Part of tL book. In 
response to numerous suggestions, a Mathematical Addendum on 'flu 

P im, ° f BeSSel Functions and Spherical Harmonica^* 
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prepared. This material has been placed intentionally at the end of 
the work rather than in the Mathematical Introduction, in as much 
as it demands a somewhat higher order of computational skill on the 

part of the reader. 

Other changes include: (u) introduction of the M.K.S. system, 
chiefly in the formulas of macroscopic electromagnetism (there is no 
reason to change the familiar numerical relations in atomic physics); 
(b) addition of a considerable number of new exercises; (c) revision of 
the numerical values of physical constants; and (d) extension of the 
list of references for further study. It is hoped that these alterations 
will increase the usefulness of the work in its dual role of text and 
reference book. 

G. J. 

I. M. F. 

Boston, Mass., and 
New Brunswick, N.J., 

U.S.A. 

April, 1950 . 
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In this revision there are two additional chapters: (1) Phenomeno- 
logical Theory of Superconductivity, (2) Theory of Elastomers; and 
the chapter on Nuclear Physics is considerably modified. A section on 
Fundamentals of the Matrix Calculus has been added to the first 
chapter and one on The Role of Lattice Defects in Dielectric Crystals 
to Chapter XLI. The Table of Physical Constants has also been revised. 
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THEORETICAL PHYSICS 


INTRODUCTION 

The Purpose and Methods of Theoretical Physics 

Purely experimental physics confines its activities to the exhibition 
of natural phenomena and the careful description of what occurs. 
One might, for example, observe the luminous effects produced by the 
passage of an electric current through a rarefied gas, record what is 
observed, and stop at that point; but to-day a procedure of this kind 
is justly regarded as being unsatisfactory. As soon as we inquire into 
the reasons for the phenomena, we enter the domain of theory, which, 
on. the basis of hypotheses admitting of more or less direct test, con- 
nects the observed phenomena and traces them back to single “ pure ” 
phenomena, thus bringing about a logical arrangement of an enormous 
amount of observational material. 

In the given example of electrical discharge in gases, an “ explana- 
tion” — i.e. a reference back to simple elements — requires theoretical 
concepts concerning the construction of the atom from electric charges 
and the mechanism of the production of charge-bearing particles. We 
thus find that the explanation of the phenomenon is far from simple, 
and that many effects operate simultaneously. Consequently, from 
the theoretical standpoint, we cannot look upon the phenomenon as 
“ simple ”, however easy it is to produce experimentally. Theory 
alone can decide whether the experimental conditions are pure, i.e. 
simple from a theoretical point of view. In the present example, an 
experiment in which electrons of measurable uniform velocity collide 
with gas atoms may be regarded as simple, in spite of the considerable 
experimental difficulties, since on the basis of such experiments one 
can attempt to give an account of the more complex phenomena of 
the glow discharge. 

While it is true that theory often sets difficult, if not impossible 
tasks for experiment, it does, on the other hand, often lighten the 
work of the experimenter by disclosing cogent relationships which 
make possible the indirect determination of inaccessible quantities, 
and thus render difficult measurements unnecessary. If it is required, 
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for example, to measure the reflecting power of a certain metal, the 
direct determination would require troublesome and withal not very 
accurate photometric measurements. However, if we are familiar with 
the theory of metallic reflection (p. 346), we will measure, instead, two 
angles — the principal angle of incidence and the principal azimuth — 
from which the value of the reflecting power is completely determined. 

Whence does theory obtain its hypotheses concerning the connexions 
between individual physical quantities and events? In the last analysis, 
only from experiment. It is the art of the theorist to deduce from the 
data at hand, connexions of the most far-reaching significance, and 
to draw therefrom conclusions which suggest new experiments. 
There does not exist a theory which draws only upon itself, and which 
has no contact with experimental results. The number of possibilities 
is too large for even the greatest genius to be able to construct, by 
unaided intuition, a world-picture which agrees with experience. 
Because of their neglect of experiment, the scientists (natural philo- 
sophers) of ancient times exhausted themselves in speculations which 
only proved of lasting value where they were connected with obser- 
vation, as in Astronomy and Mechanics. A train of thought of the 
nature of the well-known Hegelian dialogue * seems quite absurd to 
us to-day, despite the fact that when contradictions between experi- 
ment and the consequences of a well-founded theory arise, we look for 
a mistake in the experimental procedure as often as we suspect an 
error in the calculations. In such a case, however, the theory has been 
set up to begin with so as to agree with observational facts. But while 
it is true that the first task of theory is to disclose relationships, an 
equally important part of its work is to formulate these relationships 
mathematically. Mathematics is the outstanding tool of the theorist. 
Its use represents a rationalization of thought, in that the process of 
deriving important conclusions from the initial hypotheses runs in 
the trustworthy channels of rules of calculation learned once for all. 
At the same time, one must not lose sight of the meaning of the com- 
putations. The beginner, especially, is inclined to ignore the physical 
meaning of his work, as contrasted with the purely formal calcu- 
lations. The position occupied by mathematics in the field of theoretical 
physics implies that it is not the problem of the theoretical physicist 
to devise mathematical proofs. He must rely upon the soundness of 
the tool delivered to him by the mathematician. Even in a case where, 
occasionally, he must construct his own tool, he need not trouble 
about mathematical existence theorems, provided the result is suffi- 
ciently evident on physical grounds. Indeed, the strict requirements 
of pure mathematics often contradict the physical facts. For example, 
if density is defined as the limiting value of the ratio of mass to volume 

S6V6n PlaUetS -” “ this is contrary to fact.** “ So much 
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as smaller and smaller volumes are taken, the existence of atoms leads 
to meaningless density fluctuations, the values depending upon whether 
the volume element encloses an atomic nucleus or not. The “ small 
quantities ” prescribed by strict mathematics are indispensable in 
physics. Although we reckon with them as if they were “ i n fin itely 
small ” quantities, physical differentials have a not too small value. 
For such reasons, the ability to use the calculus as a tool is far more 
important to a physicist than a knowledge of strict concepts such as 
the limit mentioned above. 

Theoretical and experimental physics cannot be separated, and 
a flawless, logically ordered picture of nature is obtained only by the 
application of both methods jointly. A rationally applied theory may 
have direct and immediate practical significance. If it is of impor- 
tance to obtain the highest possible efficiency in a large-scale technical 
process, there is only one way to proceed: we must ascertain the most 
favourable conditions by exact calculations based on the theory; 
intuition is often deceptive. One need only recall, for example, the 
original pointed form of the airship, which seems to the great majority 
of people, if they are unfamiliar with hydrodynamics, to be the most 
effective form for cutting through the air. 

Thus an exact knowledge of natural laws is sooner or later trans- 
muted into economic advantages, which, as a sort of “ premium on 
intelligence ” *, are gained by any society which organizes its industrial 
processes on rational principles. 

• Tbs expression was corned by Schottky in the same connexion. 
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In this textbook the fundamental principles of the differential and integral 
calculus are assumed to be known. Any mathematics required beyond this point 
is collected in four preliminary chapters, in order to avoid later interruption of 
the physical train of thought by auxiliary mathematical discussions. A branch 
ot mathematics of especial value to theoretical physics is Vector Analysis, which 
has been developed largely by physicists. Its concepts have immediate per* 
ceptual meaning, so that many physical laws show their full significance only when 
stated in the language of vectors. It is therefore quite a mistaken view to regard 
vector analysis merely as a computational shorthand. 



CHAPTER I 
Vector Analysis 


l. The Concept of a Vector. 

Besides such magnitudes as temperature, mass, and so on, which 
are characterized by the assignment of a single number, and which 
are called scalars , there also occur in physics quantities which are not 
completely defined by the specification of a single number. The most 
important group of these is characterized by the fact that, besides the 
magnitude, the direction of the quantity must also be given. If, for 
example, we prescribe that an object, 
located by means of its centre of 
gravity P, be given a displacement 


Fig. 1 Fig. a 

of 2 cm., then the point P' occupied by the object after displace- 
ment may be anywhere on the surface of a sphere of radius 2 cm. 
with centre at P. The new position of the centre of gravity will not 
be determined uniquely unless we specify also the direction of the 
displacement in some manner Two displacements of a point may 
be combined into a single equivalent displacement by a simple 
method (fig. 1). Instead of moving the point P to F in a certain 
direction, and then displacing it in a new direction to P", it might 
have been moved directly along the third side of the triangle PP'P” 
to P ” . Since this process may be repeated, we can use it to charac- 
terize a displacement in still another way. We place P at the origin 
of a rectangular co-ordinate system, and move the point first along the 
®-axis to P v a distance a, then parallel to the y-axis a distance 6, to P a , 
and finally from here parallel to the z-axisto P', a distance c (fig. 2)! 
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Instead of this, we might have moved P at once in the direction PP' 
a distance 

d = \/a 2 + 6 2 + c 2 . 

We see, then, that the displacement of P is uniquely determined by 
specifying the three numbers a, b and c, which are called components 
of the displacement. 

A large number of physical quantities follow the same law of com- 
bination as the displacement of a point, and may, therefore, also be 
defined by three numbers. Such quantities are called vectors. We 
may represent a vector diagrammatically by means of a directed line 
segment (arrow), whose length, on any convenient scale, is numeri- 
cally equal to the magnitude of the physical quantity, and whose 
direction is that of the physical quantity. This geometric represen- 
tation gives vector quantities an important advantage over other 
non-scalar quantities arising in physics, e.g. tensors, for which no 
such simple model exists. 

Two vectors are said to be equal if they are identical with respect 
to both direction and magnitude. Since a pure displacement, i.e. a 
displacement without rotation, does not alter the direction, two vec- 
tors may still be equal, even when they lie in different (but parallel) 
lines. Physical quantities which are altered by pure displacement are 
therefore not immediately representable by vectors. To this class 
belongs, for example, the force acting on a rigid body capable of ro- 
tating about an axis: parallel displacement alters the lever arm of the 
force, and thus changes its effect. It is to be noted, further, that not 
every physical quantity which can be represented uniquely by a 
directed fine segment may be treated as a vector; it is also necessary 
to investigate whether this quantity follows the Law of Composition 
obeyed by the displacement of a point. For instance, the rotation of 
a rigid body about an axis may be represented by an arrow whose 
direction is that of the axis of rotation, and whose length in centi- 
metres is numerically equal to the rotation in degrees. Moreover, the 
direction of the arrow may be specified so that the rotation is clock- 
wise when sighting in this direction. But we do not obtain a third 
rotation equivalent to the joint effect of two rotations by combining, 
as above, their representative arrows; the vectorial composition may 
be shown to be valid only for infinitesimal rotations. 

In this book, vectors will be designated by heavy letters (bold-face 
or Clarendon type). 

2. Addition and Subtraction of Vectors; Multiplication of a Vector by 

a Scalar. 

The composition of two displacements of a point, described in 
§ 1 (p. 7), has, as will be shown at once, all the characteristics of a 
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Fig. 3 


summation. In general, we define, therefore, as the sum of two vectors 
A and B, a vector C obtained by laying off the vector B from the ter- 
minal point of A and then drawing a vector from the initial pomt of 
A to the end point of B (fig. 3). This combination is written as a vector 

e fi uation c = A + B (1) 

A vector equation is equivalent to three scalar equations, since, as was 
shown in § 1 (p. 8), a vector is determined by specifying its three 
components, and two vectors are equal 
only if the three components are respec- 
tively equal. vFor if any vector is con- 
structed as tnfe combination of three 
vectors respectively equal to its com- 
ponents, two equal vectors can be 
obtained only if their corresponding 
components are identical. v ' ' 

This geometric sum is commutative, like an ordinary sum; i.e. the 
value is independent of the order in which the summation is 
performed. As is seen from fig. 3, we 
obtain the same vector, apart from the 
immaterial parallel displacement, by laying 
off A from the end point of B and drawing 
a line from the initial point of B to the 
terminus of A. Further, a vector sum is 
associative ; i.e. in the case of a sum of 
several vectors, the individual terms may 
be grouped in any arbitrary manner: 

S=(A+B)+C=A+(B+C) j 

= B+(A+C) . . (2) 

The proof of this is evident from fig. 4. 

The sum of two vectors having the same 
direction and sense is obtained by addition 
of their lengths, the direction remaining the 

same. From this follows at once the definition of the product of a 
vector and a pure number. According to the meaning of multiplication, 
mA means the sum of m terms, each one being A, i.e. a vector having 
the same direction, but m times as long. This leads to an important 
representation for a vector: if the direction is specified by a vector 
of length 1, any vector having this direction may be represented by 
multiplying this unit vector e by the magnitude of A, which may be 
designated by the corresponding Roman letter, or by | A |: 

A= Ae=\A\e (3) 

If we think of a vector as being compounded from three vectors having 
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the directions of the axes of a rectangular co-ordinate system, and if 
we denote the lengths of these vectors (which we shall call the rect- 
angular components of A), by A x , A v , A z > and if i, j , k are the unit 
vectors in the directions of the axes, then the original vector is given, 
in terms of its components, as 

A = A x t -f- A y j -f- A z k. (4) 

The difference of the vectors C and A is a vector B which when 
added to A gives the vector C. As is seen from fig. 3 (p. 9), we obtain 
B by laying off A, with its direction reversed, from the terminus of C, 
and then connecting the initial point of C with the end point of A. 
From this it follows at once that the meaning of the multiplication of 
a vector by (—1) is merely reversal of direction , for, retaining the 
formal laws of ordinary arithmetic, we may look upon the subtraction 
of the vector A as the addition of a vector — A. 

Ex. 1. Express, by means of an equation, the fact that three vectors A, B. 
and C form a closed triangle, all three vectors describing the perimeter in the 
same direction. 

Ex. 2. Express each of the following by an equation: 

(а) Two vectors A and B are parallel. 

(б) Three vectors A, B and C are coplanar. 

3. The Scalar Product of two Vectors. 

In physics there arise certain combinations of vectors which possess 
the more important properties of products. We define as the scalar 
product of two vectors , a number ( scalar ) equal to the product of the magni- 
tudes of the two vectors multiplied by the cosine of the angle included 
between them. The scalar product is positive if the included angle is 
acute, and negative if the angle is obtuse. Then, if we agree to indicate 
the scalar product by writing the two vectors alongside each other * 
the definition of the scalar product reads 

AB = AB cos(AB), (5) 

where (AB) is the angle included between the two vectors. Since 
A cos(AB) is equal to the projection of the vector A upon the direction 
of B, the scalar product may also be defined as the product of the 
magnitude of one vector by the projection of the other upon it. A 
simple expression follows for this projection of a vector A upon a 
direction given by the unit vector e: it is merely the scalar product Ae. 

The scalar product has the property of commutativity: 

AB = BA (6) 

• It is sometimes convenient to write the scalar product within brackets, in which 
case the brackets used are round ; e.g. the product of the two scalar products AB and 
CD may be written ( AB) (CD). The scalar product itself is written (AB) or (A . B) by 
some authors. 
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for, in the defining equation (5) (p. 10), the order of the factors is 

naterial. # 

The scalar product possesses, further, the most important property 
of an ordinary number product — that of being distributive with re- 
spect to addition. This is expressed by the equation 

A(B + C + D + ...) = AB -f- AC -f AD + (7) 

It may be seen from fig. 5, that the projection, on the direction 
of A, of the sum of the three vectors 
is the same as the sum of the projections 
of the vectors, whence, on account of the 
way the scalar product was defined, equa- 
tion (7) is verified. 

However, the scalar product of two 
vectors differs from the product of two 
numbers in one respect. The latter 
vanishes only if one or both of the 
factors are zero; but the scalar product vanishes also when cos(AB) 
is zero, i.e. if the two vectors are perpendicular. Thus, for the 
three unit vectors of a rectangular co-ordinate system, denoted as 
before by i, j, k, we have 

ij= jk = ki= o. — (8) 

The scalar product of a given vector by itself is termed the square 
of the vector. Since cos ( AB) = 1 in this case, the result is simply the 
square of the magnitude of the vector: 

A 2 = AA= A 2 i t -^ (9) 

and, reciprocally, the magnitude may be written 

A = | VA 2 1 

Thus, for the unit vectors, 

H=jj=kk= 1. . . . 

If the rectangular components of two vectors are given, 

A = A a i+A v j+A a k and B = B„i + B v j -f B % k, 

we obtain, on account of the distributive property, by multiplying 
out in the usual manner, 

AB = A x B x ii + A v B 9 ij + . . . t 

which, by equations (8) and (11), reduces to 

AB = A X B X + A v B y + A a B,. . . . (12) 

Thus the scalar product of two vectors is equal to the sum of the products 
of the corresponding components . 


( 10 ) 


( 11 ) 
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In rectangular co-ordinates, the components are identical with the 
projections upon the axes, so that 




It is immediately evident from this result that if C is the vector sum 
of A and B, then each component of C is equal to the sum of the cor- 
responding components; for if equation (1) be multiplied by i there 
results immediately 

C X =A X +B X (14) 


Equation (13) is no longer true for oblique co-ordinates, as may 
be seen from fig. 6, for the two-dimensional case. If we regard as com- 
ponents of A the partial vectors in the 
directions of the axes (from which A may 
be obtained by vector addition), then it 
is evident that these partial vectors are 
not equal to the projections upon the 
axes. But the terminus of A may be 
determined uniquely by specifying the 
projections upon the axes, as well as by 
giving the partial vectors. The former 
are, therefore, in a sense, also components 
of A. Because of their different behaviour 
with respect to linear transformations, 
the components of the first kind, which 
yield A by vector addition, are called the contravariant components, 
and the projections are called the covariant components of the vector. 
This distinction is important in cases where the use of oblique co- 
ordinates is essential, as in the Generalized Theory of Relativity. 

Ex. 3. Give the geometric significance of 

(A + B) a = A a + 2AB + B 2 . 

Ex. 4. What is the meaning of (A + B) (A — B) for the case where A a = B>? 
fax. 5. Calculate the angle between the two vectors 

S n = cos a n i + cos^'-f cosy**: (n = 1, 2). 



Fig. 6 


4. The Vector Product of two Vectors. The Directed Plane Area as 
a Vector. 

Besides the scalar product, there is another equally important 
product-like combination of two vectors, which is itself a vector. The 
vector product of two vectors A and B is defined as a vector P which is 
perpendicular to the plane determined by A and B, and whose magni- 
tude is equal to the area of the parallelogram formed by A and B 
i.e. equal to AB srn(AB). The sign (sense) of P is so determined that! 
sighting along P, the shortest rotation from A (the first factor), to- 



VECTOR ANALYSIS 


13 


I.] 


ward B (the second factor), is clockwise (fig. 7). ^ This stipulation 
denies the vector product the property of commutativity possessed by 
the scalar product; for if we reverse the order 
of the factors, then the turning of B toward A 
must be clockwise when looking in the direc- 
tion of P, which means that P is now reversed. 

The vector product is indicated by writing 
the two vectors alongside each other, and 
enclosing them in square brackets.* Thus 

P = [AB] = - [BA]. . . (15) 

The magnitude of the product vector is 

P = AB sinfAB). J . . . (15') 

• 

The most important property of the vector product, without 
which the use of the term 'product would not be justified, is that of 
distributivity with respeck to addition. That is, 

[A(B+ C+D +...)] = [AB]-f [AC] -f- [AD] + ... . (16) 



The proof of this formula is somewhat cumbersome, but not difficult, 
and will not be given here. 

Just as the scalar product of two non-vanishing vectors becomes 
zero when the two vectors are perpendicular, so the vector product 
vanishes if the factors are parallel. The equation [AB] = 0 expresses 
the fact that A and B are parallel, provided that A and B differ from 
zero. In the same way, [A A] =0. For the unit vectors along the axes, 

O' i] =[jj]=[kk]=0 (17) 

[</]=*; \Jk] = i; [ki)=j. ... (18) 


Using these relationships, the application of the distributive property 
gives the following representation of the vector product in terms of 
the components: 

BJ 

A x B,)j+ (A X B V - A„B x )k. 

This may be written as a determinant: 


+ B,k)] 


[AB] = [{A x i AJ+ A.k) (BJ + 




k 

A. 

B, 



The change in sign of the vector product with reversal of the order 
of the factors manifests itself here as the well-known law that the 
interchange of two rows changes the sign of the determinant. 

* Besides the square brackets, the notation A x Bis often used. 
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The specification of the product vector P gives the following in- 
formation concerning the parallelogram formed by the two vectors 
A and B: (1) its position in space (the plane perpendicular to P); (2) the 
relative position of the sides A and B with respect to one another; 
(3) the area, numerically equal to the length of P. The form of the 
parallelogram is not given by P. In looking upon the form as being 
uni m portant, we are recognizing the fact that every portion of a plane 
may be represented by a vector whose direction and magnitude are 
uniquely determined, according to the above rules, by the orientation, 
direction of description, and area of the figure. 

For the vectors S < representing the faces of a closed polyhedron, 
the following theorem holds: If the representative vector S< is as- 
signed to each face in such manner that it is directed towards the 
outside of the solid in every case, then the sum of these vectors 
vanishes: 

£ S, = 0 for closed polyhedra. . . . (20) 


This theorem is easily verified for a tetrahedron. If we denote the 

three vectors emanating from the vertex 
(fig. 8) by A, B and C, then the three 
faces passing through this vertex are [AB], 
[BC] and [CA]. The base is represented 
by [(C — A) (B — A)]. If this is expanded, 
and the resulting vector products added to 
the above three, the result is zero. Now 
any polyhedron may be subdivided into 
a number of tetrahedra. The above 
Fig. 8 theorem holds for each of these com- 
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ponent tetrahedra, and every surface in- 
serted by the subdivision in the interior of the solid enters into the 
calculation twice, with oppositely directed normals, so that in forming 
the sum, the extra surfaces formed by cutting the original solid into 
tetrahedra cancel out, leaving only the sum over the external surfaces. 
\ e can go a step further: if we have an arbitrary closed curved 
surface, we can approximate to it by a polyhedron whose faces are 
portions of the tangent planes. In the limit, when the faces d S become 

vanishingly small, the polyhedron and the actual surface coincide, 
and the sum passes into an integral 


j)dS=0 for closed surfaces. . . . (21) 
The sign j) indicates integration over a closed region. A closed surface 


of this kind is often termed a “ 

may be termed a “ shell integral 
analysis. 


shell this type of integral, which 
”, will be met with often in vector 
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In spite of tlie fact that we can assign the product vector P uniquely 
to the parallelogram formed by two vectors, there is a certain dif- 
ference between the parallelogram and its representative vector. If, 
namely, the parallelogram be reflected in its own plane, the direction 
of one side with respect to the other remains unaltered, but the sign 
of the product vector changes. This fundamental difference between 
vectors like displacements (polar vectors) and those which represent 
a direction of rotation (axial vectors) comes out clearly when we 
change from a right-handed to a left-handed system of co-ordinates. 
But since such a change is generally unnecessary and superfluous, 
this difference is of no consequence in actual calculation. On the 
other hand, in the four- dimensional vector analysis of the Theory of 
Relativity, the distinction is of essential importance; in that case a 
polar vector has four components, while an axial vector has six. 

Ex. 6. Verify (16), using an expression containing only two terms. 

Ex. 7. What is the value of [AB] 2 4- (AB) 2 ? 

Ex. 8. In solid analytical geometry, all theorems concerning lines and planes 
may be derived with little calculation if we operate with the radius vector r and 
specify directions by unit vectors, instead of using Cartesian co-ordinates. In 
this manner give: 

(a) the equation of a plane whose normal direction is that of n and whose 
distance from the origin is p , 

(b) the distance of a point r 0 from this plane, 

(c) the equation of the plane passing through three points r v r iy r a . 

5. Multiple Products. 

(a) Product of a vector vrith the scalar ‘product of two 

other vectors 

Since a scalar product is an ordinary number, the product of a 
vector A with the scalar product BC means the multiplication of A 
with this number, according to the rule given in § 2 (p. 9), i.e. a vector 
having the direction of A. This is written A(BC) or (BC) A. It is to be 
noticed that (AB)C has an entirely different meaning from A(BC), 
viz. the former is a vector in the direction of C. 

(i b ) Scalar product of a vector with the vector product of two 

other vectors 

If P = [AB], then the product CP = C[AB] is the volume of the 
parallelepiped having the three vectors A, B, and C as contiguous 
edges, for the magnitude of P is equal to the area of the parallelogram 
formed by A and B (fig. 9). Thus 

CP = CP cos (CP), 

is the volume of the parallelepiped, since C cos (CP) is equal to its 
altitude. One might equally well consider the base of the parallelepiped 
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bo be formed by the vectors C and A, and then form the scalar product 
of this quantity with B. This means that the volume can also be 

represented by the product [CA]B. Because of 
this property, the square brackets are unneces- 
sary, and from now on such a triple product 
will be denoted simply by writing the three 
factors alongside each other. However, the 
sign of the vector 'product changes each time the 
cyclic order of the factors is changed. The 
effect of interchanging two factors is seen from 
fig. 9. [BA] is a vector directed downward, 
and hence cos ( CP) becomes negative. We may 
set up the following rule for the sign of the triple scalar product: 
The product ABC is positive if the three vectors are relatively 
arranged like the axes of a right-handed co-ordinate system. Thus 

ABC = BCA = CAB = — ACB = — BAC = — CBA. (22) 

If the components of the three vectors are given, then 

ABC =(A x i+A v j+ A z k) {(B V C Z - B z C v )i + (B a C x - B x C,)j 

+ (B x C v -B v C x )k}. 

This may be written as a determinant: 


ABC = 


A x A v A z 
B x B y B z 
C x C v C z 


(23) 


Equations (22) thus express the well-known property of determinants 

according to which the interchange of 
two rows causes a change in algebraic 
sign. 

(c) Vector product of a vector with the 
vector product of two other vectors 

If P = [BC], then the vector product 

R = [AP] = [A [BC] ] 

Fig. io signifies a vector lying in the plane of 

B and C, for P= [BC] is perpendicular 
to th 18 plane, but R = [AP] is in turn perpendicular to P, and so 
falls in the plane of B and C (fig. 10). 

Since any vector V lying in the plane of B and C may be written 
in the form 

V = aB-f BC 
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by suitably choosing the scalar multipliers a and /3, R must have this 
form, and it remains to determine the numbers a and /?. To this end, 
we introduce a co-ordinate system in such manner that the x-axis is 
in the direction of C and the t/-axis is in the plane of B and C. This 
may be done without restricting the generality of the discussion. We 

then have 

A= A x i+ A v j+ A 2 k, 

B = B x i -{- B y j , 

C = C x i. 

The vector product [BC] then assumes the simple form 

[BC] = - B v C x k , 

and [A [BC] ] = - A v B v C x i + A x B v C x j . 

If the vector A x B x C x i be added and subtracted, there results 
[A [BC] ] = A x C x (B x t + B v j) - (A X B X + A v B v )C x i. 

This is equivalent to the important formula 

[A [BC] ] = B(AC) — C(AB) (24) 

The coefficients a and jS are thus the scalar products AC and — AB. 

With the aid of this formula, complex products may be converted 
to simpler forms. Take, for example, the scalar product of two vector 
products: 

P = [AB] [CD]. 

We put [AB] = E and obtain, by (22) (p. 16) 

P = E[CD] = — C[ED] = — C[ [AB] D], 
and then, using (24), 

P= C {A(DB) — B(AD) }. 

Hence [AB] [CD] = (AC) (BD) — (BC) (AD). . . (25) 

Ex. 9. Transform [ [AB] [CD] ]. 


6. Differentiation of a Vector with respect to a Scalar; Application to 
the Theory of Space Curves, 


Let a vector v be a continuous function of a continuous scalar 
variable u : 

v = v(u). 


If the variable u is increased by A«, the vector will change by an 
amount 

Av = v(u+ Au) — v(u), 
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and, in complete analogy with scalar functions, we define the deriva 
tive dv/du as the limit 


, dv ,. v(u + A u) — v{u) 

v = -j- = lim t 

du Au-^- n Au 


. (26) 


Derivatives of a higher order are defined similarly, e.g.: 

*(dv\ 

_ \du ) d 2 v .. v'{u + A u) — v'{u) 


v" = 


du 


= Ti = lun 
du * A u — > 0 


Aw 


(27) 


If the position vector r of the points on a space curve be given as 

a function of the length of arc s, measured 
from a given initial point, then the magnitude 
of A r is identical with As, and the limit of 
Ar/As is a vector of length 1, having the direc- 
tion of the tangent to the curve (fig. 11). This 
unit tangent vector is denoted by t, and we have 

dr 



t = 


ds‘ 


. • (28) 


The rules for the differentiation of products 
also correspond to those for ordinary scalar functions, as may be 
seen readily by writing the derivatives as limits. In the case of the 
vector product, however, the order of the factors is important. We 
thus have 


d( AB) _ d B d A _ d B d A 

du du^ du du^ du' 


but 


d[AB] 

du 



‘£MM=[43-[ 


B 


d A 

du 


(29) 


(30) 


If a vector does not change in length, then, since v 2 = const., 

dv 2 dv 

~r~ = 2 v . -j— = 0. 
au du 


Since neither dv/du nor v is to vanish, this means that the derivative 
of a vector of constant length is perpendicular to the vector. This is 
also evident geometrically, for if the length is constant, the end point 
of the vector is restricted to move on a sphere. If the increment is 
infini tesimal, it is tangent to the sphere, and hence is perpendicular 
to the vector itself. 

This result may be applied to the unit tangent t. Since this 
vector is always of unit length, its derivative must always be 
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perpendicular to t, and so must be a vector in the normal plane 
to the curve. But this derivative, being the vector difference of 
two consecutive tangent vectors, 
must lie in the osculating plane 
formed by the latter, and so its 
direction is that of the principal 
normal , which direction we de- 
signate by the unit vector n. 

In order to calculate the magni- 
tude of the vector dt/ds , we note 
that the curve, in the neighbour- 
hood of two consecutive tangents, 
corresponding to three neighbour- 
ing points, may be replaced by 
the circle of curvature, whose 
centre M is determined by the 
intersection of the perpendicu- 
lars to the two consecutive 
tangents (fig. 12). The angle dj> 
between these tangents is the 
same as that between the two 
radii of the circle of curvature, 
then, in the limit, 

ds = pd(f> 



Fig. i a 


If p is the radius of this circle 


(31) 


On the other hand, | dt | = d(f>, whence 


dt 

ds 


1 

9 


Thus is obtained the important equation 

dt _ d 2 r _ n 
ds ds 2 p 


(32) 


The value of the curvature is obtained from this result by squaring 
(scalar product) and extracting the root: 

TWie unit vector b perpendicular to the osculating plane, gives the 
direction of the binormal. If we stipulate that the three vectors ft 
and b , in this order, form a right-handed triad, then 

?>=[tn\ (34) 

_ For a Plane ourve, the osoulating plane (and therefore the binormal) have 
constant direction, so that dbjds vanishes. For a non-plane (skew) ourve, this 
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derivative may be looked upon as a measure of the second curvature, and its 
magnitude is known as the torsion t. The direction of db/ds is found in the follow- 
ing manner: From the properties of the unit vector, db/ds must be perpendicular 
to b, i.e. parallel to the osculating plane. Further, since b is perpendicular to 
t, bt = 0, and so 

,db +h dt_ n 

ds ds da 


Since, also, b is perpendicular to n = p (dt/ds), there results 


d_b 

ds 



i.e. the vector db/ds is perpendicular to t. Since the former is parallel to the 
osculating plane, it has the direction of the principal normal n. To fix the sign, 
we specify that the torsion is to be reckoned positive if the rotation of b is clock- 
wise when looking in the direction of t. This gives the vector equation 


db 

ds 


— t n 


(35) 


These examples show how much the use of vector analysis simplifies and 
clarifies the material of differential geometry. In what follows, we shall require 
only the principles presented above, and the reader is referred to modern text- 
books on differential geometry for further applications. 


Ex. 10. When is [a^J =0? 

Ex. 11. An arbitrary surface A has a plane boundary curve, 
and show that [6S] = 0. 


Form 



7. Space Derivatives of a Scalar Quantity. 

If a scalar quantity — the temperature, for example — is given as 
a function of a point in a region, we speak of a scalar field. Let the 
scalar point function have the value u at a certain point. The change 
in the value of this function, corresponding to a displacement of amount 
ds , will depend on the direction of the displacement, i.e. the derivative 
du/ds may have any one of an infinity of values, depending on the 
direction of the displacement ds. It is easily shown, however, that we 
can specify the change in u in any direction, provided we know a cer- 
tain vector obtainable by means of differential operations. In rect- 
angular co-ordinates we have 


Again, we have 


ds = dxi~\- dyj -f- dzk . 





Evidently, then, du may be regarded as the scalar product of ds with 
a vector 

du . , du . . du 
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which is called the gradient of u, and is written gradu: 






that is, du = ds (gradu) (37) 

The meaning of the gradient may be seen in this way: Leaving dis- 
continuous functions out of considera- 
tion, and dealing only with continuous 
point-functions, we can connect all 
points in the region having the same 
value of u by means of surfaces — 
so-called level surfaces (fig. 13). Each 
of these surfaces is characterized by 
the fact that displacements wholly within 
the surface do not alter the value of 
u. If, then, we let ds 0 lie in one of the level surfaces, we have 

ds^gradw) = 0. 



But since neither ds 0 nor grad u is to vanish, this means that the vector 
gradw is perpendicular to the level surface. 

Further, from the definition of the scalar product, it follows that 
the increment du has its greatest value — the magnitude of ds remain- 
ing constant — when ds and gradw are 
in the same direction, or, in other 
words, the vector gradw gives the 
direction of greatest change (“ slope ”) 
of the point-function u — hence the 
name “ gradient According to (37), 
its magnitude is equal to dujds if ds 
is perpendicular to the level surface. 

The change of u in any other direction 
is obtained, according to (37), by multi- 
plying by the cosine of the angle <f> 
between the gradient and the direction 
of displacement. This suggests a simple graphical construction for 
the several values of du , for a ds of fixed magnitude (fig. 14). Two 
lines of length equal to the magnitude of gradw are dirawn through 
P 0 and normal to the level surface, one above and one below, and 
the spheres having these two lines as diameters are described. Then 
a secant drawn in any direction gives the magnitude of the change in 
that direction, in relation to the change du in the direction of the 
gradient. It must be remembered that ds is held constant here. 

The derivative of u in the direction of 5 is denoted by dujds, and 
is thus obtained by projection of gradu on the direction <s. 



Fig. 14 
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Numerous examples of the physical occurrence of the gradient concept will 
be met with in many sections of this work. One example may be given now. 
A weather map has marked upon it the lines of equal air pressure (isobars). The 
direction of the wind is then given, apart from the effect of the earth’s rotation, 
by the direction of greatest pressure drop, which is perpendicular to the iso- 
bars; and the strength of the wind is determined by the magnitude of the pres- 
sure drop. 


By introducing the gradient we have obtained a vector field , that 
of grad w 

alters in value from place to place in the region. However, not every 
vector field may be obtained from a scalar field in this manner. If a 

vector field has this property, a result of 
great importance follows: Imagine a curve C 
drawn in the vector field (fig. 15). Let this 
curve consist of a succession of line-elements 
ds. The line integral of a vector v along the 
curve C is defined as the limit, for ds in- 
finitely small, of the sum of the scalar pro- 
ducts vds, the summation to begin at P 0 
and end at P v If now v is the gradient of a scalar u, this line integral 
between P 0 and P x is independent of the form of the curve O, for 
the individual magnitudes 

vds = ds (gradw) = du 



, from a scalar field , for grad u is a vector magnitude which 


represent, according to the definition of the gradient, the change in 
the function u in moving along a distance ds. The sum of these changes 
is the difference between the values of u at P x and at P 0 , and this dif- 
ference is independent of the path by which one passes from P 0 to P 1 . 

If there are two paths, C and C", then reversing the direction of 
motion along C’ gives with C a closed curve which is described in one 
direction. But since the reversal of C' causes all its line-elements ds 
to reverse their sign, and since the value of the line integral is the 
same on both portions of the curve, the integral along the path 
vanishes. We thus have the important theorem: If a vector field is 
represenUible as a field of the gradient of a scalar -point function, the value 
of the line integral of the vector , taken between two points in the region , 
is independent of the path; and the line integral over a closed path 
vanishes. Thus 

rPi 

J (gradu) ds = u,— u 0 , .... (38) 


and 



(gradw) ds = 0 



From the definition (36) of the gradient, a useful rule follows: 
If w is a function of a simple point function v, then, since 



u 
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3 u _ du dv 
dx dv dx* 

grad u = ~ gradv. (39) 

ifo. 12. What is the meaning of grad r? (r = distance from origin.) 

Ex. 13. Let a scalar point function depend only on the distance r from the 
origin. Compute grad f(r). 

8. The Concept of Divergence and Gauss’s Theorem. 

If an arbitrary vector field is given, we may, as in the analogous case of § 6 
(p. 17), wish to determine the change in the field vector v accompanying a dis- 
placement ds . As may readily be seen, the calculation of this change necessitates 
a knowledge of three gradients, viz. those of the rectangular components, whioh 
give the amounts of the changes of the components. We thus arrive at a set of 
nine scalars (three vectors). Quantities of this nature will not be met with until 
later, and so we put this question aside, and take up another physically significant 
differential operation, which leads from a vector field to a scalar field. 

In order to understand this operation more clearly, we give the 
field vector a special meaning. Let the vector v represent, in direc- 
tion and magnitude, the volume of fluid 
passing, per second, through an area of 
1 sq. cm. perpendicular to v (fig. 16). 

The amount of fluid passing through the 
element of area AjS, each second, is then 
given by the scalar product vAS; for Fig. 16 

the quantity passing through is given 

by v 9i the component of the flow perpendicular to the surface element. 
Let us now enclose a space r in the vector field by means of an arbi- 
trary closed surface. Call the elements of this surface dS and let them 
be so oriented that the normals are directed outward. The amount of 
fluid passing outward through an element dS, in unit time, is therefore 
v dS; inward flow has the negative sign. Summation of these elemen- 
ts ry contributions over the entire surface gives the excess amount of 
fluid passing to the outside each second. This yields the surface in- 
tegral j) vdS. If there are no sources within the surface, the integral 

must vanish. If sources exist within, the integral furnishes a measure 
of the combined strength of all sources within the bounded 
volume. The total strength depends also on the volume, and 
the important quantity is the ratio of the surface integral to 
the volume. In the case of sources continuously distributed in 
space, by means of this ratio we can express the source-strength 
for each element of volume. For the limiting value of the 
ratio, the expression divt> (read “divergence of v ”), has been 
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introduced. Divergence is, then, equivalent to source-strength. Thus 

div v = lim-^ j) vdS (40) 


If the components of v are continuous point functions having con- 
tinuous derivatives, this limit is finite, and independent of the form of 
the volume element. This latter fact is somewhat troublesome to prove, 
but it is easy to show that the limit is finite. Thus, let the volume 
element selected be a small sphere of radius p at whose centre the 
field vector has the value v 0 . The departures from this value are then, 
in any case, of the order of magnitude of the radius p. Now the sur- 
face element on the sphere is p 2 dQ, where dQ. is an elementary solid 
angle. The quantities to be summed in forming the integral are thus 
of the form (ti 0 -f- zvp)d S, where w is a variable finite vector, whose 

further properties need not be given. Since by (21) (p. 14) j) dS=0, 
thepartof the integral contributed by the first part, viz. £v 0 dS= v 0 j)dS 



divided into volume elements, 
for each of these: 


vanishes, and there remains a con- 
tribution from the second part, of 
the order of p 3 . But the volume of 
the sphere is 4-77-p 3 /3, so that the 
result of division by this quantity 
must be finite. 

Let us now consider a finite volume 
which is bounded by a closed surface 
(fig. 17). Let this region be sub- 
The definition of the divergence holds 



vdS. 


Let both sides be suj 


Hill 


ed over all the elements of volume. 


The left 


side becomes the volume integral Jdiv v dr. In the right member, all 

the interior surface elements occur twice (as surfaces of adjacent 
volume elements) with oppositely directed normals. Their contri- 
butions cancel out, since the vector v, on account of its continuity, 
has the same magnitude both times. There remain only the contri- 
butions of elements of the outer surface. This yields Gauss's Theorem : 



div v dr. 



That is, the surface integral of the vector v, taken over a closed surface , 
is equal to the volume integral of the divergence of v taken throughout the 
enclosed volume. This is sometimes called the Divergence Theorem. 

If the hydrodynamical case is taken as a basis for demonstrating 
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this theorem, the result appears almost self-evident. The theorem, 
however, is true for any vector field satisfying the conditions given 

above. . . * 

To obtain an expression for the divergence of a vector in terms or 

its rectangular components, we choose as 

volume element a rectangular parallelepiped 

with edges dx , dy and dz. At x, an amount 

of fluid dydz(v x ) x passes through the face 

dydz (fig. 18). Only the normal component 

of v contributes to the fluid passing through 

the surface. This fluid enters the volume, 

since the normal has the direction of the 

negative x-axis. 

At x-\-dx, an amount dydz(v x ) x+dx streams 
out through the opposite face. The normal is 
here directed along the positive z-axis. Expanding ( v x ) x + dx by 
Taylor’s Theorem, we have for the contribution of these two surface 

elements 



Fig. 1 8 



dv x 

dx 


dx — v x 



Analogous contributions are furnished by the two other pairs of oppo- 
site faces. Thus we have 

and, after division by the volume dxdydz , 

dv x dv v dv s 

div v = ~ + - 5 - (42) 

dx dy dz 

Ex. 14. Calculate div r from first principles, and also by the formula. 

Ex. 15. How may the “ gold ” balance of a country be calculated from the 
amount of imports and exports ? To what extent does this represent an applica- 
tion of Gauss’s Theorem? (“ Gold ” = equivalent of merchandise.) 


9. The Curl of a Vector, and Stokes’s Theorem. 

Another equally important differential operation leads from a 
given vector field not to a scalar field, as in forming the divergence, 
but to another vector field. 

We refer back to the relation (38) (p. 22), according to which the 
line integral of a vector v t taken over a closed curve, vanishes if the 
vector is the gradient of a scalar field. In general, this integral does 
not vanish; and its magnitude measures an important property of 
the field, especially when the integral is taken for infinitesimal but- 
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face elements, and referred to unit area. Here, too. it may be shown 
that the limiting value is independent of the particular form of the 
boundary, and again an estimation of the order of magnitude shows 
that the limit has a meaning when v is a continuous differentiable 

point function. The value of the integral, 
however, depends on the orientation of the 
surface element, as may be shown by a simple 
example. Let the vector v have the same 
direction at all points in the region, but let its 
magnitude, represented by the length of the 
arrows in fig. 19, vary in a direction at right 
angles to the direction of u. If the surface 
element is placed so that its normal is in the direction of v, then 
v is perpendicular to ds along the entire circuit, and the integral 
vanishes. On the other hand, if the normal is perpendicular 
to v, then the net contribution of the edges perpendicular to v is 
zero, but that of the edges parallel to v is not, since the magnitude 

of v was taken to be different at the 
latter edges. The value of the limit 



Fig. IO 



1 _ 
A S 


L„= lim^ f vds 


(43) 


is thus a function of the direction of the 
normal to the surface element. We now 
show that we can give the value of this limit 
for any direction, if we know the value for 
any three non-coplanar directions. For 
convenience we take these three directions 
to be the co-ordinate axes. As surface 
element we choose a small triangle 
A>S = ABC, whose vertices lie in the axes 
(fig. 20). Instead of passing around the triangle ABC, we could 
describe, in turn, the triangles OBC = A S x , OCA = A S v and 
OAB = A S t . We can calculate the limit 


AS,/ 


vds, &c. 


(43') 


OBO 


for each. In adding the line integrals along the partial triangles, each 
axis will be covered twice, in opposite directions. There remains only 
the part contributed by the sides AB, BC and CA, so that the re- 

latiouship LAS=L x AS x +L v bS v + L.bS, . . . (44) 

is satisfied. On the other hand, from § 4 (p. 14), 

AS= nkS= iAS a +j&S v + £AS,. . . (45) 
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The unit normal vector to A S is denoted here by n, and is directed 
toward the outside of the tetrahedron, while i, j, k point inward. If 
cos a, cosj8, cosy are the direction cosines of the normals n t this 
equation is, in terms of components, 

^S X = AjS cos a, A<S y = AScosfr AS, = AS cosy. 


Putting these values in (44) gives 

L n — L a cosa -f- L v cos/3 -f- L* cosy. . . (46) 

The sum on the right may be interpreted as the scalar product of the 
unit normal vector n with a vector whose components are T* X> 

This vector is called “ the curl of v ” (written: curl v)* 

We have therefore 

curlu = iL x + jL v 4- kL z (47) 

It follows that 

A AS = (j) vds — w(curl^) AS= (curl v) AS. . (48) 

Just as we obtained Gauss’s Theorem, when considering the diver- 
gence, by passing from infinitesimal volume 
elements to finite volumes, so we obtain 
here a new and important relationship by 
considering a portion of a surface bounded 
by a curve. This curve need not be plane, 
but is to be traversed in a given sense 
(fig. 21). Let the surface be subdivided 
into infinitely small elements. The boun- 
dary of each element is to be traversed in the same sense as the outer 
curve. Then, for each element, 



Fig. si 



vds = (curla)AS<. 


If we add these equations for the separate elements, the contributions 
from the inner dividing lines cancel out on the left side of the resulting 
equation, since each segment of these lines is traversed twice, in oppo- 
site directions. The left member is then merely the integral along the 
bounding curve, and we obtain the important formula called Stokes's 
Theorem: 

j) vds= JoxalvdB (49) 

The line integral of the vector v , taken over a dosed curve C, is equal to 
the surface integral of the curl of v, taken over any surface having C as 
a boundary . 

• The veotor is sometimes called “ the rotation of V ”, and written: rot 8 . 
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There follows at once an important property of the vector, curl v\ 
its divergence is zero. If we put curl v = w, then Gauss’s Theorem 
may be applied to a closed surface, and 

wdS= Jdivwdr = Jd iv curlzjdT. 

Let this closed surface be divided into two parts by a closed curve 
(fig. 22). Then 

J div curl vdr=J s wdS + f g wdS=f g cuilvdS+f g curlz>dS. 

If, now, we apply Stokes’s Theorem to each of 
the two parts, we must remember that in forming 
the integral over the entire surface, all the nor- 
mals are directed outward; but in the applica- 
tion of this theorem, the direction of the 
normals is determined by the direction in 
which the curve C is traversed. If we agree 
to take dS always toward the outside, then 
the sign must be reversed in transforming the 
second integral. We thus have 

f curl vdr = vds — ^ vds= 0. 

Since this equation holds for any volume, it follows that 

div curl v = 0 (50) 

Further, the vanishing of the line integral of the gradient of a 

. scalar point function, taken over a closed 

curve [(38 ) (p. 22)], may be expressed in 

the following way by applying Stokes’s 
. Theorem: 



Fig. 22 




. 

h 


t 


curl grad u = 0. 


(51) 


-y 

Fig. 23 


B 


It remains to calculate the components of 
the vector curl v from the components of 
the vector v. From (47) (p. 27) and (43') 
(p. 26), we obtain the ^-component of curl V 
... , _ b y traversing a surface element which lies 

th 9 6 , f; P ’ ane - Let . this element have the form of a square of 
side 2 h (cf. fig. 23) with sides parallel to the axes. Only the y-com- 
ponent of v contributes to the integral along the horizontal sides 
only the ^component contributes on the vertical sides. If we let the 

" iu “ ■* ,h * »* *i"“' ». 

- (£)/ + (s).»- 





i] 

Along CD: 
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The contribution of the side AB thus is 


J vds — J v v dy — 2k(v v ) 0 — 2h 2 ^~J . 


In the same way, the side CD yields 


f v ds=—f v y dy= — 2h(v v ) 0 —2h 2 (^\. 

J 0 •'-A \ V Z /0 


Together, these two sides give 


r B r D 

/ vds -f / vds = 


-"ft 


Similarly, the two vertical sides give 


J vds+J vds = 4k 


Since the area of the square is 4A 2 , division by this quantity yields 


and, in like manner. 


Hence 


/ l X d v z d v v 
(curia)* = — * - 

/ i x dv x dv . 
(cuxlv) v = jl, 

/ i x dfl* 

( O '* 1 *)• = 0^-87 


65 - fe) < + fe>- + 65 - %>• ^ 


) 


On the understanding that the “ product ” of the differentiating 
operator d/dx with a quantity u means dujdx, then the last equation 
may be written as a determinant: 


curlzj = 


Ex. 16. Calculate curl r. 


i j k 

111 

dx 3 y dz 


Vx v v v , 


. . . . ( 52 ) 


* The subscripts are omitted; the derivatives refer, as before, to the centre of the 
element. 
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10. The Operator V. 

If we compare the three formulas 

(36) (p. 21): grad u= ip + -f- k?~, 

ox oy dz 

m ( P . 25), div * , £ + 

(«■) (?' 22): -d- ,<$-%) +>(£- |i) + *(U-U) 

we recognize that we can express all of them symbolically in terms of 
a vector whose components are d/dx , a/3y, d/dz, on the principle just 
explained (cf. the last sentence of the preceding section). This sym- 
bolical vector is denoted by V. It was introduced by Sir William 
Hamilton, and is called “ nabla ”, after an ancient Assyrian harp 
whose form V resembles: * 


rr • 3 . . 9 , . 9 

v ~ t rx +J r y + k d-z 


(53) 


If we multiply a scalar quantity u with this vector operator we 
obtain 

A u . U du 

dx 


Vu«f^+y^+^=giadu. 


dy 


• • 


(5d) 


If we s form the scalar product of the nabla operator with a vector v 
we obtain, according to the definition of scalar product, the sum of 
the products of corresponding components: 


dv dv, 

Vv -3l+ej+3; = dlvv - 


(55) 


Finally, equation (52) (p. 29) represents the vector product of the 
nabla operator with v: 

[V v] = curl v ( 56 ) 

From the formal correspondence of the operations grad, div, and curl 
with multiplication, it follows at once that these operations are dis- 
tnbutive with respect to addition. That is 


^ (®i +» 2 )=VUj+V v t , &c. 


(57) 


By starting with the definition of divergence, and generalizing, we 

* h " “ bl * 0| " r ““ M of .h. 



that is, 
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We shall illustrate the method of proving these formulae by taking 
the case of the gradient. Since the form of the volume element is 
entirely at our disposal, we take 
a small cylinder, the generating 
lines of which are in the direction 
of gradw and are therefore nor- 
mal to the level surface (fig. 24). 

Apart from terms of higher order, 
the value of u over the base of 
the cylinder is u 0 — k | grad u | ; 
at the top of the cylinder, the 
value is u 0 -f- h | gradw |. It is 
to be remembered that, in form- 
ing the surface integral, the elements dS must always point out- 
ward from the elementary volume. The lateral surface of the cylinder 
does not contribute to the integral, since u has a constant value in each 
transverse section; and to each surface element there corresponds 
another which is equal and opposite. Also, we obtain the contributions 
of the ends of the cylinder by multiplying the area A S by the dif- 
ference in the values of u , and by expressing the direction of dS by 
means of gradw: J 

(j) dSu = (AS) (2 h gradw) = At gradu. 

Dividing by the volume, and passing to the limit, we obtain the third 
equation of (58). 

These formulae yield important generalizations of Gauss’s Theorem 
the proof of which is analogous to that given above: * 

< j > dSu = j'gt&dudr, ( 59 ) 

$ [dSv]=Jcm\vdr ( 60 ) 



Theorems (59) and (60) are also easily proved by writing them out in their 
scalar forms (which are often useful) and using (41). For (41) can be writ*™ 
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+ nC)dS = J ( 


d_A 

dx 



where A, B, C are scalar functions, and l, in, n are the direction cosines of the 
outward normal to S. Since A, B, C are independent, it follows that 




These formulae are closely connected with Green’s Theorem (p. 270). They are 

equivalent to (59), and also lead at once to (60), which involves three equations 
of the type 




dx. 


11. Calculation of the Gradient in a Vector Field; Fundamental Prin- 
ciples of Tensor Analysis. 

§ 7 (P- 20) the question “ How does a scalar point function 
change with a displacement dsV 9 was treated. This question may be 
extended to vector fields, and we may inquire “ How does the vector 
v change with a displacement dsV' If we start with the components 
of the vector, this question may be referred back to the case of a 
scalar point function. If a vector field be given, then at each point in 
space we have a field of three scalars, viz. the three components. But, 

according to § 7 (p. 21), the change in the components may be written 
down at once: 



dv x = ds grad v x 

dv x dv XJ 

- te dx+ d^ dy+ 

d_Vx 
d z 

dz, 



dv v — ds grad v v 

= ! i dx+ % dy + 

dv v 

3 z 

dz, 

. (61) 

and so 

dv z == ds gradu. 

-\>+\ v ; dy + 

dv z 
3 z 

dz. 




* 


d v = 

idv x -f jdv v -f kdv z 






ds gradVj.. i -j- ds grad^.y -f- ds gradv z 

.k. 

• (61') 

Merely as an abbreviation we 

write 





dv - 

= ds grad • v, 

• 

• • 

. (62) 


which is read: “ds-vector gradient of v” 

In order to be able to calculate dv , three vectors (or nine scalars) 

must be known We shall see that such forms, consisting of nine 

numbers, have also a physical meaning. From (61) we see that the 

components of the vector dv are linear functions of the components 

of the vector ds , the coefficients of these functions being the partial 
derivatives dvjdx, dv x /dy, &c. r 

Let us now investigate such linear vector functions in detail. As 
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in (61), let the components of a vector w be linear functions of the 
components of a vector v: 


w x = a ll v x -f a 12 v y -b (^3 v*4 

O'OO I 


(63) 


W V = a 21 V x 4* a 22 V V 4“ a 23 V z-> | 

W z = fl 31 V x + a 32 V V 4* a 33 V z' J 

In this manner the vector iv is placed in one-to-one correspondence 
with the vector v. The scheme of coefficients a ik has thus an independent 
meaning if this correspondence is such that the passage from v to w is 
independent of the particular co-ordinate system in which the vectors are 
resolved into components. In this case, the coefficients a ik are said to be 
the components of a Tensor * 

A vector is a physically determined directed quantity in space, but 
the values of the components depend on the choice of axes. In the 

h j > k system, ^ — v ^ v y — v j, v 2 = vk. 

For a system k ’ , turned with respect to this system, 

*' = An * + Pit J + Pi*k> . 

f = Pn* + PnJ + P»k, 1 .... ( 64 ) 

ft — An i + J + A 33 

where, since i’ , j\ k! again form a rectangular system, the p ik represent 
the direction cosines of the new axes referred to the old; e.g. fi ik are 
the direction cosines of the new axes with respect to the old. Then, 
as may be verified by forming the scalar products i' j' — 0 , i'i' = 1 , &c., 

Pll + Al 2 2 + Al 3 2 = 1 , 

A11A21 4 “ A12A22 4* A13A23 — 

in general, 2 A* A* ~ 1 for i=j I 

= 0 for i 4=y. j 

Thus the old axes are expressed in terms of the new: 

* = Ai ? + Pnf + fi*ft.) 
j = A 12 *' 4- A 22 f 4- A 32 ft, . . . (66) 

k — A 13 + A 23 f 4~ A 33 ft • . 

The components of v become, in the new co-ordinate system, 

v * = v *' = Pu * * 4 - A12 vj+p^vh* An 4 - A12 v» 4 * As v./| 

V = vf = A21 V i -j- A22 Vj + p^vk= p a v x 4- A22 Vy 4- A23 v« ( 67 ) 
v, = vft = p 31 vi+p 32 vj 4- p*vk= p zl v x + p 32 v y + A33V 

i.e. the vector components transform in the same way as the axes. 
We thus have also Q f o , D 

Vx = PllVffl' 4 * A21 Vy' 4 - AsiV,', 

Vy = Al2 V*' 4“ p22 V v' 4" p32 V z’> ' • • • . (68) 

V z = Al3 Vx' 4“ feVy' 4" AssVy'* 

* Also called in German works an Affinor, the term originating in the f aot that for 
D - r the Oft give an affine transformation of space (or homogeneous strain). 

* (s 711) 


(65) 
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How must the tensor components a ik transform, if a change of co- 
ordinate system is not to alter the relationship of to v ? The follow- 
ing equations must hold: 


and 


w x = flu v x + a 12 v y + o 13 iy 1 

Wy = d 2 \V x ~ b ®22^*V ~ | 

U'z = “T 1 / “b ® 33 ^z> J 

X' = a n V f “b a \2 v v' ~r a l3 V r'» 1 
u 'v = a 2 i' v x- -T a&'vy -f r 

W z’ = G 31 l V + a -32 V V’ ~r <*33 V z‘ ' J 


it 


. . (69) 
- - (70) 


In addition, w x , w v , w. and w x -, u\, w. ■ must be so related that the 
same vector zv is determined by both triads. If we multiply the first 
of the unaccented equations by fi n , the second by p u , the third by fi 13 , 
and add, we obtain, by (67), 

w x’ — (fin a ll “b Pl 2 ®21 "b &3°3l) U * "f* (Pll a l 2 “b Pl 2 a 22 “b P I3°32) U V 

+ (ftl°13 + Pl2 a 23 "b Pl3 a -33) V z (71) 


If we express the v x , v v , v z here in terms of the new components, 
according to (68), we obtain uy as a function of tv, ty, ty. This value 
of «y must be identical with that defined by (70). Comparing the 
coefficients of <v, ty, ty, we have 


°11 — Al 2 °ll "b PnPl2 a l2 ~b PllPl 3 a 13 “b P12 Pll ®21 ~b Pl2~ a 22 
“b P\2 Pl 3 ®23 “b PuPll 0 ?! “b PviP\2 a 1i2 “b Pl 3 ~ a 33 - * ( 72 ) 

Thus the tensor components transform, under rotation of axes, in 
such manner that the new components become linear functions of all 
the old ones, the coefficients being the squares and products of the fi ik . 

To find the coefficients for the tensor components a ik ' set tv = v \> 
v x = ty &c.. and use (67) to form the product v/v k of the vector com- 
ponents, remembering that in expanding the brackets the order of the 
factors Vi and v k must not be changed. Replacing v/v k by a ik , the co- 
efficients of v f v k yield those of a ik . 

It is possible to represent a tensor by means of vectors. For the 
three components of w in equation (63) (p. 33) may be written as 
the scalar products of the vector v with the three vectors a v a 2 , a 3 . 
This amounts to regarding the three coefficients of each row as vector 
components : 


(73) 


a i — a «i i ~b a i2 j + 
w = a^v • i -f- a 2 v j -f a 3 v • k. 

But, as may be seen in the example of forming a gradient, the three vec- 
tors depend upon the choice of co-ordinate system, for the three vectors 
grad i' x , gradty and gradv 2 are dependent upon the co-ordinate system. 

Since the components of w are linear homogeneous functions of 
the components of v> according to (63) (p. 33), this functional cor- 
respondence O satisfies the relationship 

O (ty + v 2 ) = O (ty) -f O (ty), 


• . (74) 
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i.e. O is distributive with respect to addition. But this is the essential 
property of multiplication, since the other properties of products 
have already been given up in the case of the vector product. We 
may thus consider the vector tju as resulting from the multiplication 
of the vector v with the tensor O. This tensor may be represented, 
according to (73), by three pairs of vectors (dyads): 

a^‘ i + a 2 'j + a z * k (75) 


The first factors in each term are called the antecedents , the second 
factors, the consequents', the dot does not indicate multiplication, but 
merely separates the factors. We understand <Pv to mean a vector 
which results from scalar multiplication of the consequents with v, 
and v O means a vector obtained from v by scalar multiplication of 
the antecedents with v. Thus 


<bv = a l *tv + a i 'jv + a 9 'kv 9 . . (76) 
vQ = a 1 vt + a i vj+ a z v k. . . (77) 


The two vectors are, in general, different. The tensor resulting 
from interchanging the antecedents and consequents is called the 
conjugate tensor 0 0 . 


0 a = i'O l +j>a 1 + k-a 2 
<P 0 v= i-a^v +j-a 2 v + k*a 2 v. 




Since it is immaterial whether the scalar a^v is written before or after 
the vector i, comparison with (77) shows that 


u<D= <b 0 v. 



In the component representation of a tensor, changing to the conjugate 

tensor means the interchange of rows and columns in the array of 
coefficients. 


The dyad, indicated by writing two vectors side by side with a dot 
separating them,* also has the leading property of multiplication, viz. 
aifltnbutivity with respect to addition: 


(«i + * 2 ) * * = «T f + flj-f. .... (80) 

To show this it is sufficient to form the scalar product of the ante- 
cedents with a vector v , in which case 


v(a x + a 2 ) • i= z )a 1 • i -f va 2 • i, 

arnce the scalar products are distributive; the result therefore follows. 
We may therefore look upon the union of two vectors to form a dyad 

mnIfcinomUla. 6868 8 ° me ° a868 ’ * neoefl8ar y for olari ty, e.g. if the factors aw 
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as a new kind of non-commutative product, called the direct or dyadic 
product. This leads to a new interpretation of the expression ds grad • v 
with which we started. If we form the dyadic product of the operator 
nabla with v, there results 


V • v 


-( i i +j k + k y ■ + + 

= grad v x • i + grad v v ‘ j -j- grad v z • k. 






(81) 


If this tensor is now multiplied by the displacement ds, we obtain 
exactly (62), viz. 

ds V • v = ds grad • v (82) 


By dyadic multiplication of the nabla operator with the field vector v we 
obtain a tensor called the vector gradient of v. Multiplication of the dis- 
placement ds with this tensor as post-factor gives the change of the vector 
v due to the displacement ds. 

Let us again consider tensors in general. A tensor is said to be 
symmetric if 

Oo = (b, or, in terms of components, if a ile = a ki , . (83) 

and is antisymmetric (skew-symmetric) if 
(b 0 = — <b, or, in terms of components, 

By (83), the symmetric tensor has but 6 independent components, 
and, according to (83'), the antisymmetric tensor has only three com- 
ponents,. i.e. as many as a vector. As a matter of fact, in three dimen- 
sions, the multiplication of a vector with an antisymmetric tensor 
always may be replaced by forming the vector product with a vector. 
In physics, sym m etric tensors are specially important. They may be 
represented by a surface of the second degree. The vectors v are drawn 
as radius vectors r from a fixed origin. By means of the tensor 0, 
a vector w = r’ is allocated to each vector r. Let the scalar product 
rr' be designated by p. Then 

rr* = p= r(Q r) = a n x 2 -f- a 12 xy + a 12 xz + a 21 yx + a 22 y 2 + .... 

Because of the proviso that 0 is to be a symmetric tensor, we 
have 

p = r (0 r) = a^y x 2 + a^y 2 4- ^32 + 2 a 12 xy + ! la 2Z yz + 2 a 31 zx, ( 84 ) 
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from which there follows 


*z = <hi x + <hiy + «i3 2 
r 7 = J grady. 




This gives a graphical construction for r 7 . By (85), r 7 has the direction 
of the normal to the surface p = const., at the end point P of r. If we 
draw the surface p = 1, then, according to (84), the magnitude of r ’ 
is the reciprocal of the projection of 
r on the direction of the normal at 
P (cf. fig. 25). 

The surface p = 1 is called the 
tensor ellipsoid , although without 
further limitation on the values of 
the coefficients, it might be any 
second degree surface. The repre- 
sentation by means of a second degree 
surface is seen to be possible, since 
both a surface of this kind and a Fi »- 2 s 

symmetric tensor have six indepen- 
dent coefficients. Further, for every surface of the second degree 
there is one specially appropriate co-ordinate system, viz. that of 
the principal axes. In this system of axes the surface has the 
equation 

p == a,s' 2 -f- a n y'* + a ni z' 2 .... (84 7 ) 
and the tensor, referred to these new i\ j ', k! axes, becomes simply 

0 = <*/ *' • *' + a„f • f + a ltI k'-k'. . . (75') 

The components of w are thus given by 



w * = «V = Oi/ty ; «V = a ttt vj. 




™ lAue uue x- rene* ionnuiffl lequns. (32), p. 19, and (35), p. 201, usina 
> vector gradient. J * 

Ex. 18. Determine the directions of the principal axes of a tensor ellipsoid 
using the relation that r' = \r in these directions. 


12. Calculation of more complicated Vector Differential Expressions 
with the help of the Nabla Operator. 

pie operations pad, div, curl and grad may be performed readily, 
m the case of products of scalar or vector point functions, with the 
help of the V-operator. It is only necessary to remember that V is a 
sign of differentiation as well as a vector. Everything standing to the 
right of such a sign is subject to the process of differentiation. In 
oi^ary differentiation, the quantities to be held constant may always 
be placed to the left of the differentiating symbol. But sinoe the 
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vector product, and especially the dyadic product, of two vectors are 
not commutative — though of course in a vector product we can always 
change the order by changing the sign — we shall indicate by means 
of the subscript c the constancy of a factor on the right of the sign 
of differentiation. This will permit us to leave constants on the right 
side of the sign V. By following a limiting process entirely analogous 
to that for the ordinary differentiation of products, it may be shown 
that, in general, a space derivative (grad, div, curl, grad) of a pro- 
duct is equal to the sum of the corresponding expressions for products 
in which only one factor is variable. Thus we obtain the following 
practical rule for the space differentiation of products: Using the 
V- symbol , write the differential expression as the sum of derivatives in 
which only one factor is subject to differentiation. Transform these by 
the rules of vector algebra in such a way that all quantities not to be dif- 
ferentiated are on the left of the sign of differentiation , and interpret 
the resulting expressions in terms of the special symbols , grad , 
div , ( fee. 

This method is easily understood in the case of the more important 
and frequently occurring formulae. These expressions will be computed 
below. 

(a) Products of scalar point functions 

1. grad uvw = Vuv e w t + V u c vw, -f- Vu t v e w 

= vw grad u -f- wu grad v -f- uv grad w. . . (87) 

(6) Product of a scalar point function with a vectorial 

point function 

2. div uv = V uv t -)- V u c v — v grad u u div v. . (88) 

3. curl uv = [V mv,] + [Vm,u]= [grad u, u curl v. (89) 

4. grad-MZJ= V • = grad u • r; -j- w grad-z>. (90) 


(c) Products of vectorial point functions 
grau vw = V • vw e -f- V • v e w. 


Now [v curlte;] = [ v [Vzaj] ] = V • v c w — vV'W 

= grad v e w — v grad • w\ 

thus 

grad vw—[v curlze»]-f v grad • w + [w curlu] + te; grad* v. (91) 
div[z;te;]= V[vw,]+ V[v e w) = w[V v]— v[Vw] 

= w curl v — v curlzw. 


( 92 ) 
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curl[aza] = [V[ »«>.]] 4 - [V[v,w]] 

= w grad • a — w div v — a grad * w -f- V div w. (93) 


(d) Second derivatives 

3 du 3 du 3 du 
d,vgrad«=VV«=g i Tx + Ty Vy + Tz Tz 

d 2 u . d 2 u d z u . 

~ + df + “ A “ - 


. . (94) 


The scalar product of the V-operator with itself is therefore equal to 

02 02 02 

the Laplacian operator ^ + gp + 0 ^ 2 ’ denoted by A. Note, how- 
ever that the rule for products stated in § 12, p. 38, does not apply to 
A, as the following example shows: 

div graduu = div(w gradu -f- v gradw) = uAv -f v Au 

+ 2 grad u gradv. 


Again, we have curl gradw = [VVu] = 0 (49) 


This result, already obtained in § 9 (p. 28), follows purely formally 
here from the rule of vector algebra which states that a product of the 
form [AAa] vanishes. Similar remarks hold for the relationship 

div curl v = V[Va] =0, .... (48) 

also obtained in § 9 (p. 28). 

On the other hand, there is no simpler vector expression for 
grad diva. In rectangular co-ordinates, this expression signifies the 
vector 


. 3 

1 dx 


div v +j ~ div v k~ div v = i ^ 


- d 2 v v 3»t»A 

dx 2 3 xdy 3x3 z) 





3x3 y 3 y 2 dydz 


d 2 v a , d 2 v v d 2 v n 
3x 3 z 3 y 3 z 3 z 2 


(95) 


With the help of the expression for grad diva, the expression curl 

curl a may be transformed in a manner which finds important appli- 
cation in electrodynamics: 


curl curla=[V[Va]]=V-Va-VV-a=grad diva- W-a. (96) 

Here we have the scalar product of V with itself standing before a 
vector. If this vector is written in terms of its components, the com- 
plete expression curl curl v may be calculated with little difficulty; 
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the computation shows that the result is grad diva minus a vector 
with components as follows: 


VV • v = 




i Av x -\-jAv y -f kAv z — Av. (97) 


Thus the rule VV = A, which held for simple point functions, is true 
here also. Hence we have the important relation 

curl curl v = grad div v — Av. . . . (98) 


Ex. 19. Calculate the following surface integrals for constant vector r 


(a) (j) (ir).dSi (b) r.idS. 

Ex. 20. Calculate A -. 

r 

Ex. 21. Convert the surface integral 


(j) (u grad v — v grad u)dS 


into a volume integral. In particular apply the result to v = 1/r. It should be 
remembered that the point 0 is to be excluded from the region of integration, 
on account of the discontinuity of 1/r. Thus the surface integral must also ex- 
tend over the small sphere enclosing 0 and having its normal directed toward 
this point. 


13.* Differential Vector Operations in Curvilinear Orthogonal Co- 
ordinates. 

In many cases occurring in practice, it is more expedient to resolve 
a vector not in Cartesian co-ordinates, as done exclusively thus far, 
but to use other parameters which are often inherent in the problem 
itself. For example, in studying the propagation of electromagnetic 
waves over the earth’s surface, spherical surface co-ordinates (latitude 
and longitude) are indicated. In space, a point is located by means 
of three parameters u, v, w. The rectangular co-ordinates are then 
functions of these parameters: 

x = ^(tt, v, w), y = (/> 2 {u, V, w) t z = (j> 3 (u, V , w). . (99) 

If one co-ordinate, say u, is held constant, (99) will represent the 
parametric equation of the surface u = const. In the same way we get 
another surface for v = const., and through every point in space there 

* Starred sections may be omitted without destroying the continuity. 
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will be three surfaces corresponding to the three co-ordinate planes. 
We restrict our considerations to those systems where the three co- 
ordinate surfaces form an orthogonal set, i.e. are normal to each other in 
pairs (fig. 26), as this type of curvilinear co-ordinate system is almost 
exclusively used. In the case of spherical space co-ordinates it is 
immediately evident that the three surfaces — sphere, radial cone and 
meridian plane — are mutually perpendicular. Elliptical co-ordinates 
are also orthogonal. Their surfaces consist of confocal ellipsoids of 
rotation, hyperboloids of either one or two sheets, and planes con- 
taining one or the other principal axis of the conics. If the co-ordinate 
surfaces are orthogonal, so will be the curves of intersection, called 
co-ordinate lines. The tangents drawn at an intersection of three 



Fig. 26 


surfaces will be characterized by the unit vectors u , v, w, correspond- 
mg to the Cartesian unit vectors i, j, k. There is an important dif- 
ference between the two sets: the former change their direction as 
we go from point to point. If a vector A is given as a function of posi- 
tion, its respective components at the point P are understood to be 
its projections on the directions u, v, w drawn at the point in ques- 
tion. The scalar and vector products retain their form in this scheme 
and it m only when we pass to differential operations that the relations 
differ from their Cartesian forms. If the co-ordinate u is allowed 

to increase by du, there is a displacement in the u direction of 
amount 

dSu = U{u , v, w)du u, (100) 

hriH fi T ti0 \ P( “’ 1 V ’ W) changes hom one P lace to another. In 

an mdmdual case its value may be read off from a figure. In the same 

ds v = Y{u, v, w) dv v, ds w = W(u, c, w)dw w . (100a) 
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The components of the gradient of a scalar function ifi may be found 
at once as the projections in the u, v, w directions, and these pro- 
jections represent the derivatives dip/ds in these directions. From 
( 100 ), 


A , 1 ^ 

e rad = u du 


, 1 dip 

grad v ip = y g , 


, , 1 dip 

grad^^g^. 


( 101 ) 


In order to compute the divergence of a vector A we start with the 
definition (40), p. 24, selecting as volume element a parallelepiped 
formed by co-ordinate surfaces. Consider the two sides normal to u. 
One of these is strictly perpendicular; the other is perpendicular as 
far as quantities of the second order. Only the w-component A u will 
contribute to the surface integral. At the point u the area of the sur- 
face element is | ds v | | ds w | = ( VW) U dvdw , while at u -f- du it is 
{VW) u + du dv dw, so that considering the opposite directions of the 
normals, the net contribution of both surfaces will be 

— (A U VW) U dv dw -f- (A U VW) U + du dv dw = du dv dw. 


For min g the analogous expression for the other pairs of surfaces and 
dividing by the volume UVW du dv dw , we have 


divA = ^Pi^+ 0 -^ (102) 


UVW L du 


dv 


dw J 


Substituting the special value grad i p for the vector A, we have 



In computing the curl we proceed again as in the case of rect- 
angular Cartesian co-ordinates by traversing, one after the other, three 
elementary rectangles lying, respectively, in the three co-ordinate 
planes. It must be remembered that the lengths of two opposite sides 
will differ somewhat. In the surface u = const, we get the contri- 
butions ( A v V) w dv and (— A v V) w + dw dv by going along the v curves and 
the contributions — {A w W) v dw and {A w W) v + dv dw along the w curves. 
Expanding these expressions, adding them and dividing the result by 
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the area, we get the u component of the curl. The other two follow 
by permutation of the letters: 

1 [d(A w W) d(A v V) 1 ] 

= 


, , 1 [d(A u U) d(A w W)-] 

(c " 1A ^mrsi 3m J’ 

, , 1 r S(A v V) 3(4,17)1 

(curl A)„ = m ^ a;- J- 


(104) 


The expression for the vector gradient in curvilinear co-ordinates 
is troublesome to derive, and since it is seldom needed it will not be 
deduced here. 

Ex. 22. Compute grad t|>, div A, Atj> and curl A in cylindrical co-ordinates 
(z, p, <}>) and in space polar co-ordinates (r, <J>, 0). 


14. Degeneration of the Vector Differential Operations at Surfaces of 
Discontinuity in the Field. 

In proving the above relationships of vector analysis, we have 
hitherto imposed the condition that the point 
function be continuous in the region under 
consideration. However, the case often arises 
in physics where a scalar or vector point 
function has different values on two sides of 
a given surface. In this case, the limiting Fig. 27 . 

values defined by our differential operations 
become i nfi ni te. If, for example, we consider the divergence 


v 2 

n 

V/ 




div v = lim v dS, 


this limit has a meaning only if the vector v is continuous within the 
volume element At. Suppose, however, that a surface of discontinuity 
passes through At. Then we may choose as volume element a small 
cylinder whose bases AS are close to the surface of discontinuity, and 
whose lateral surface is perpendicular to it (fig. 27). With continuous 
point functions, the difference v 2 — v x of the values on the two bases 
of the cylinder vanishes as the altitude of the cylinder is reduced to 
zero, here, however, v 2 v 1 becomes the difference of the values on 
the two sides of the surface of discontinuity. If we divide the integral 

/ v taken over the surface of the cylinder, by the volume At, the 
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result is infinite, for At vanishes to a higher order as the cylinder 
is made smaller. If we wish to obtain a limit in such cases, we 
must divide by the area of the base AS rather than by the volume 
of the cylinder. The resulting limit is called the surface divergence 
(Div v) 


Div v = lim v </S. 


(105) 


If we introduce as unit normal vector a vector of length 1 pointing 
from side 1 toward side 2, then, with a sufficiently small surface element, 
we have for the lower side 


J v dS = —nv 1 AS, 


and for the upper side 


J v dS = -\-nv 2 AS. 


The negative sign is to be used on the lower side, since the normals 
are to point outward in forming the surface integral, but the vector 
n was taken to point inward. The contribution of the lateral cylindri- 
cal surface can be made an infinitesimal of higher order by reducing 
the altitude of the cylinder. We thus have, as a limi t, the contri- 
butions of the bases of the cylinder, and so 

Div v = lim ^ n (v 2 — v Y ) AS = n (v 2 — v x ). . (106) 


Since the definition by means of the limit of a surface integral divided 
by the enclosed volume is not confined to the divergence, but repre- 
sents the general co-ordinate-free definition of the V-operator, we 
obtain for surfaces of discontinuity, in analogy with the ordinary 
V-operator, a reduced operator — the “ surface nabla ” — denoted by 
the sign ||. Its particular meaning depends upon whether it is applied 
to a scalar- or a vector-point function, and will be made clear by the 
following : 


Surface gradient: || u = Grad u = n{u 2 — u^). 


(107) 


Surface divergence* || v = Div v = n{v 2 — v x ). . . . (108) 

Surface curl: [\\v] = Curl v = [n{v 2 — v 1 )] (109) 

Surface vector gradient: || • v = Grad • v = n • {v 2 — v t ). . (110) 
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If a surface of discontinuity passes through the region of integration 
V, the theorem of Gauss must be supplemented by a term . corre- 
sponding to the surface divergence at this surface of discontinuity. 
This is done as follows: A closed surface is drawn surrounding the 
surface of discontinuity closely (fig. 28). This separates the latter surface 



from the region of integration. Gauss’s Theorem is now applied to the 
remaining volume, remembering that the new surface F 2> as well as 
the outer surface F v contributes to the surface integral: 




div v dr. 


If n is again used to indicate the unit normal vector of the surface of 
discontinuity, pointing from side 1 to side 2, then 


whence 


v dS =J nv x dS — J nv 2 dS = — J Div vdS , 

vdS =/div vdr -f J Div vdS. . . (Ill) 


How is the reduced nabla operation to be performed on a product 
of point functions which are discontinuous on a given surface ? If the 
calculation^ rules of § 12 (p. 38) are formally applied here, what 
meaning is to be assigned to a quantity with subscript c? A simple 
manipulation makes this clear. We have 

»Ku 2 - tv*) = n — «,) + 

If we designate the average values (Vj_ + v z )/2 in the customary way, 
by a horizontal bar over the symbols, then 

Grad uv == || uv e + || u 0 v = nofa — ttj + nu^v a — v x ). (1 12) 
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The subscript c on a quantity standing after the surface nabla thus 
signifies that this quantity is to be replaced by the mean of the values 
on the two sides of the surface of discontinuity. In this way all the 
calculational rules of § 12 (p. 38) may be applied formally to the 
degenerate differential operations. 

15.* Fundamentals of the Matrix Calculus.* 

On p. 33 the coefficients a ik fulfilling certain transformation con- 
ditions were designated as tensor components. It proves useful in many 
branches of physics to introduce quite general arrays of such coefficients 
as mathematical quantities. An array of this kind need not be square; 
it may have m rows and n columns, and would then be called a matrix 
of order m, n. The individual coefficients a ik are called elements of the 
matrix. The entire matrix is designated by a bold-face letter: A. 

Two matrices are equal when and only when their corresponding 
elements are identical. Thus a matrix equation replaces ran numerical 
equations. A matrix having all its elements equal to zero is called a 
null matrix, 0. A square matrix having only its diagonal elements 
different from zero is called a diagonal matrix. If all these diagonal 
elements are equal to unity the matrix is called the unit matrix, E. 

A square matrix is said to be symmetric if each element is equal to 
the one which is in a symmetric position to it with respect to the 
diagonal, that is, if a ik = a ki . A square matrix is said to be anti- 
symmetric (or skew symmetric) if each element is opposite in sign to its 
“reflection” in the diagonal, that is, if a ik = —a ki and further, if the 

diagonal elements are zero (a„ = 0). The matrix A formed by inter- 
changing rows and columns of A is called the transpose of A. A sym- 
metric matrix is, then, its own transpose. 

As in the case of tensors, which are merely a special kind of matrix, 
the sum of two matrices is defined as the matrix in which each element 
is the sum of the corresponding elements of the separate matrices. 
Logically, the product of a matrix by a number /z is then taken to be 
the matrix, each of whose elements is /z times that of the original 
matrix. 

Up to this point we have given merely an alternative way of writing 
the results of p. 33. The matrix calculus becomes really fruitful when 
we define the product of two matrices. This permits us to replace long- 
winded summation formulas by a new symbolism. We start again with 
a linear transformation which, through the matrix A, gives us the 
variables z k in terms of the variables y k . If, in turn, the variables y k 
are given in terms of x k through the medium of a matrix B, then the 

* Although the matrix calculus will be used very little in the text, a brief account 
of the basic principles is given here because of the increasing use of this branch of 
mathematics. 
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z k are themselves linear functions of the x k . It is readily computed 
that the coefficients c ik are given by 

c ik — ^u a il^lk (H3) 

! 


This means that the matrix element c ik is obtained by multiplying each 
element of the ith row of A by the corresponding element of the fcth 
column of B and adding the products. It is logical to designate this 
combination as the product of the two matrices, since it can be shown 
readily that it has the property of distributivity 


A(B + C) = AB + AC . . . . (114) 
as well as of associativity 

A(BC) = (AB)C (115) 


On the other hand, like the vector product, it does not possess com- 
mutativity: The result is quite different from the previous if the z k 
and the y k are related by the matrix B, and the y k and the x k by the 
matrix A. 

It may happen that the product of two matrices is independent of 
their order of multiplication ; the two are then said to be commutative ; 
for example, the unit matrix E is commutative with every matrix. 
The product of any matrix with the null matrix Q is always the null 
matrix itself. On the other hand, the product of two matrices, neither 
of which is the null matrix, may happen to be the null matrix; it is 
not proper to conclude that if the matrix product vanishes one of the 
factors must be the null matrix. In view of the rule for forming the 
elements of the product matrix, we have 

AB = BA (116) 

The definition of matrix product is not confined to square matrices. 
The procedure designated by (113) can be carried out as long as the 
number of columns of the first matrix is equal to the number of rows 
of the second. Thus matrices of order m,n may always be multiplied 
by those of order n,r\ the product will be a matrix of order m,r. This 
permits us to write vectors as single-column or single-row matrices, e.g, 

r =Q (117) 

The transformation prescribed by a matrix A can be written 



( 118 ) 
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The product of a matrix A of order 3,3 with the matrix r of order 3,1 
is a matrix of order 3,1 and so is again a vector. 

If the determinant formed from the coefficients of a given square 
matrix is different from zero, then the matrix which represents y k as 
functions of x k may be solved for the x k as functions of y k . The new 
scheme which results is the inverse, or reciprocal , of the original matrix 
and is designated A -1 . Since application of A to the x k gives us the y k , 
and application of A -1 to the y k again gives us the x k , it must be true 
that 

A _1 A = AA- 1 = E (119) 


The second equality may be verified by starting with the y k . Since the 
inverse of the inverse matrix must again be the original matrix, 


Moreover, 


(A- 1 )- 1 = A. 
(AB)- 1 = B-’A- 1 . 


( 120 ) 

( 121 ) 


This is true because if we multiply both sides of the equation by AB, 

E = ABB- 1 A- 1 = AA- 1 = E, . . . . (122) 


on account of the associative law. 

An orthogonal matrix is one in which the sum of the products of 
corresponding members of two rows has the following properties: 


= 0 for / ^ i) 

k 

= 1 tor l = if 


(123) 


The definition may be expressed in another way. The product sums 
defined above represent the product of the matrix A with its transpose 

A. If a matrix is orthogonal, then 




A A = 



(124) 


However, this is also the definition of the matrix A -1 which is re- 
ciprocal to A. Thus an orthogonal matrix has the property that its 
transpose is the same as its reciprocal. An example of an orthogonal 
matrix is the set of direction cosines of two rectangular co-ordinate 
systems that differ in orientation (equation (64), p. 33). 

The matrix calculus may be extended to include matrices with 
complex elements. The transposed matrix is then one where, in addition 
to the interchange of rows and columns, each element is replaced by 
its complex conjugate. A symmetric matrix then corresponds with the 
so-called Hermitian matrix, and the orthogonal matrix, because of 

U U = E becomes identical with the unit matrix. 
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Ex. 22 a. The question posed in Ex. 18 (p. 37) may be formulated in matrix 
notation as follows: Given an n- dimensional symmetric square matrix A. Find 
the vectors r (single- column matrices) for, which 

A r = \r or (A — XE)*' = 0. 


Show that (a) this leads to an equation of nth order in X; (6) the r corresponding 
to various values of X are mutually perpendicular (since it is only the ratios of the 
components of r that enter, their magnitude may be taken equal to unity); 
(c) the matrix formed by using these vector components as rows is an orthogonal 
matrix, S; ( d ) the matrix S -1 AS is a diagonal matrix with the roots X,- occupying 
the diagonal. 



CHAPTER n 


Mathematical Representation of Periodic Phenomena; 

Theory of Vibrations and Waves 

1. Simple Harmonic Vibrations. 

The simplest temporally periodic process (vibration) is described 
mathematically by the sine or cosine function. If the process repeats 
itself v times per second, the function is either 

u = A sin 277-^ or u=Acos27rvt. ... (1) 

A is called the amplitude and v the frequency of the vibration. Be- 
sides the frequency v, we introduce the angular frequency t o = 27 tv 
to simplify the notation. A phenomenon described by a simple sine 
or cosine function is not only the simplest vibration mathematically, 
but physically as well. It is called a simple harmonic vibration. 

The computations are considerably simplified by using imaginary 
exponentials instead of the trigonometric functions with their cumber- 
some addition theorems. The connexion between the two types of 
functions is given by the Euler Formula 


e im>t = cos cut -f- i sinotf. 


( 2 ) 


This notation also leads to an extremely important representation of 
a vibration, using the complex plane (cf. Chap. Ill); for 


z = Ae ia> 


( 3 ) 


represents a complex number whose representative point, or index , 
as we shall call it, describes a circle of radius A with angular velocity 
(o. The projections on the real and imaginary axes are respectively 

x = R(z) = A cose ot; y = I(z) = A sinarf. . . (4) 

But since physics deals with real quantities, the final results of a 
computation using complex numbers must admit of translation into 
real magnitudes. This is very simple; an equation involving complex 
numbers means that the real as well as the imaginary parts satisfy 
the equation. We can therefore take either the real or the imaginary 
part of the equation as the physical statement. 

The square of the amplitude is often of importance. It is most 
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readily obtained from the complex representation by multiplying the 
vibration function by its conjugate: 

A 2 = (5) 


The great advantage of using complex numbers manifests itself in 
the case where two vibrations of the same frequency, but of different 
phase, are to be combined. If there is but one vibration function, the 
point at which we begin to count the time is of no consequence, and 
so we write this function in the form (1) or (3). If, however, we are 
required to add a second vibration to this one, the new oscillation will 
not, in general, attain its maximum value at the same instant as the 
original one, but at a certain interval before or after. In this case 
we say that there is a difference of phase 8 between the two vibrations. 
In the real representation, then, 

u 1 =A 1 coswt, u 2 — A 2 cos(otf + 8), 


and the resultant of the two oscillations is 


Uj + u 2 = A x cosa )t -f- A 2 cosS cos a >t — A 2 sin 8 sinatf. (6) 

The right member may be written as a single cosine vibration by 
transforming the coefficients of sino>£ and cos cot so that they assume 
the character of the sine and cosine of an angle (f > , multiplied by the 
same coefficient: 


A cosS) cosajf— A 2 sin 8 sin ojt 


= V (A x +A 2 cos 8) 2 -M 2 2 sin 2 8 j ^j= 


A 1 -}-A 2 cosS 


+ A 2 cos S) 2 -f- A 2 2 sin 2 8 


COS cot 


A 9 sin 8 , ) 

ana*}. 


Put 


V (.^i+.dg 003 ^) 2 -}"-^ 2 sin 8 8 


A 1 -j- A 2 cos 8 


A 0 sin 8 


V(A t + A 2 cos 8) 2 + A * sin 2 8 Sm ^ 


• • • c 

. . . (7) 

= = cos (j>; 

• • ( 8 ) 

| = sin <£. 

• • ( 9 ) 


Then -f u 2 = y/ A x 2 + A 2 Z -f- 2A t A 2 cos 8 (cosari -f- <f>) 

= A cos(ari + ^). (10) 

According to the expression under the radical, the amplitude A of the 
resulting vibration is given as the third side of the triangle having the 
amplitudes A 1 and A 2 as adjacent sides, and 8 as exterior angle (fig. 1). 
The difference in phase <j> between the resultant vibration and u, is 
the angle between the sides A and of this triangle. 

This construction corresponds exactly with the addition, in the 
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Gauss plane, of two complex numbers, which add like vectors. Tc 
obtain the resultant of two vibrations having equal frequencies , but differ- 
ing in phase, add the corresponding 
complex numbers. The amplitude 
and phase of the resultant are given, 
respectively, by the length and direc- 
tion of the radius vector representing 
this sum. 

In electrical engineering, where this method is most frequently 
used, this representation is referred to as the “ vector diagram ”, on 
account of the way in which the quantities are combined. It must be 
remembered, however, that the complex numbers by means of which 
the vibration phenomenon is represented are not vectors in the physical 
sense; the latter are spatial, and are not confined to a plane. For this 
reason, it would be more correct to speak of the “ index diagram ” 
than of the vector diagram. This diagram may be drawn for any 
instant of time; but since the phase difference is constant, the entire 
triangle rotates as a rigid figure with the angular speed a>, as the 
time advances. We may therefore choose any position — for example, 
the real axis — for the line of the first vibration. 




If more than two vibrations are to be compounded, we obtain the 
index of the resultant as before by laying o£E the individual indices in 
succession — like vectors — connecting the initial point of the first 
index with the terminus of the last. A case occurring frequently in 
practice is the combination of a large number of vibrations of equal 
amplitude, there being a constant difference of phase A <£ between 
successive vibrations. The vector diagram then becomes a portion of 
a P°lyg° n » which approaches a circular arc as the individual amplitudes 
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decrease (fig- 2). The phase difference between the first and the iVth 
vibration is (N — 1)A <f>, which may be written N A <f> if A is large. 
The greatest value of the resultant amplitude is Na , obtained when 
A<f> = 0. As A<j> increases, the broken line becomes a circular arc of 
greater and greater curvature. If the arc closes to form a full circle, 
i.e. if A«^> = 27 t/N, the resulting amplitude vanishes. As the phase 
difference further increases, a small arc remains after the circle is 
closed. This arc again yields a finite value of the resultant, until, at 
A <j> = 4tt/ 2V, a second zero point is reached, and so on. 

For arbitrary phase differences A</» we obtain, from fig. 2, a 
resultant amplitude 


A = 2r sin 


NA<f> 
2 1 


where rN A <f> = Na, i.e. 




or, putting N A (f> = <f>, 




The representation by complex numbers also gives a simple con- 
struction for the derivative. If 

z = Aeh*, 

then ~ = uoAe^* = iwz. 

at 


This is a complex number obtained by multiplying z by to and turn- 
ing it through an angle 7 t/ 2 in the counter-clockwise direction. Dif- 
ferentiating again, 


dh 
dt 2 


= — == —wh. 


This is a complex number whose index again falls in the line Oz, but 
in the opposite direction. We see, in addition, that a simple harmonic 
vibration satisfies the differential equation 

dh 

W + «* =* 0 , ( 12 ) 


which is called the differential equation of an (undamped) simple 
harmonic vibration. Similarly, the indefinite integral 

fzdt— J- Ae iut = -- z 
J id) CD 
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is found by rotating the index z through an angle 7r/2 in the clockwise 
direction, and dividing by co. 


2. Representation of more complicated Periodic Phenomena by Series 
of Harmonic Terms. Fourier Series. The Fourier Integral. 

Let a purely arbitrary process, which is repeated v times per second, 
be represented by u = f(t). That is to say, we have 

/( i +S)= / W (13) 

Let us make the proviso that f(t) is not to pass through infinitely many 
maxima and mini m a within the time of one vibration, and that the 
integral // (t) dt is to converge even if the function has a finite number 
of infinite values in this range. A finite number of discontinuities is 
also permissible. Under these conditions the function f(t) may be 
represented over a complete period, and thus from t = — oo to 
t= -j- co , except at the discontinuities, by a series of simple harmonic 
functions, the frequencies in which are integral multiples of the funda- 
mental frequency. Such series are called Fourier Series , after their dis- 
coverer. For a proof of the possibility of developing/^) in a Fourier 
series under these very general conditions, we must refer to textbooks on 
mathematics. Here we shall give only the method of determining the 

coefficients. It is most convenient to start from the complex repre- 
sentation and write 


/( 0 = a o + + a 2 e2iu,t -f . . . a n e*”* + . . . 

4- a_ 1 e- , ‘ u< -f- a_ 2 e~ 2i ^ -f . . . + 


n =* +co 

= 2 . . . (14) 

n= —oo 


Smce the left member is real, the coefficients of the series on the right 

must be such that no imaginary terms occur. To determine a 0 , we 

integrate both sides over one complete period, i.e. from 0 to 2ttI(d. 

Because of the periodicity of the complex exponential function, all 

contributions from the terms on the right vanish, with the exception 
of a 0 , for r 


r 2nl “ a r n 2 "-/" n 

J a n e m ” 1 di=—P-\ e*"' = -P- < e 2 ™ 

J o m aj[_ J u mco K 


e°) = 0. (15) 


On the other hand, a Q is determined by the equation 

r 2 ' 7 - , 2 tt r OJ r 2nl “ 

Jo f( ) * = w a °’ 0r “o = 2 /W dt = /(*)•* 


( 16 ) 


/ (<) is the usual notation for the mean value of / ( t ). 


II.] 


THEORY OF VIBRATIONS AND WAVES 


55 


To determine the other coefficients, we multiply both sides by e~ mai 
and integrate, as before, from 0 to 277 / 0 ). Again, all terms on the right 
cancel out (because of the periodicity of the imaginary exponentials) 
except the o n -term, which contains no exponential factor, and which 
gives the value (27r/o>)a n on integration. Thus we have 


and similarly 


«„ = £ dt, (17) 

rr/ai 

l -n = ol J (13) 


The reduction to the usual real form of the Fourier series, 

f(t) = &,)+&! coso)£+ ...6 n cosna>£-|- ...c 1 sincof-b...c n sinnaj£-f-*«* 

00 GO 

= 21 b n cosncot + 2 c 9 sinncot, (19) 

0 1 

is very simple. If the real and imaginary parts of each term in the 
complex form are separated, and the terms in a n and a_ n are paired, 
then we have 

a n e n,UJt = ~ (cos nwt + i sin n cot) 

uTT 


| J f (t) cosncot dt — i J f(t) sinncot dtj'. 


a_ n e~ mut — (cosncot — i sinncut) 

Jj 7T 

( JLirfu) -2tr/u> ^ 

[J f(t) cosncot dt-\- % J f (t) sinncot dtj 

.CO 

a n e mut + a~n e ~ mu,t =~ cosncot f (£) cos n cot dt 

7 T Jn 


7 T 

.2 IT /«!* 


i.e. 


co . r*" 1 * 

-f - sinned / f (t) sinncot dt\ 

7T 


(O /* 2n, / W y.2tT / Q) 

&o =/(<); K = - f(t) cosncot dt]c n = - f(t)amncotdt 

77 * / o t r Jq 


( 20 ) 


It is to be noted that the summation extends from n = — 00 to 

n = + 00 in the complex representation, but from 0 or from 1 to -}- 00 
if the real representation is used.* 

By inserting the expressions for the coefficients, we can represent 
the Fourier Series by a single summation. However, it is necessary to 


Iq moat physical problems /(i) is not given by an analytical expression, but by an 
1, so that the integrals must be evaluated numerically or graphically, 

1D2 methods as Well AA TUAnhani on) rlatnAAo as 11a d X a ■*»! <->»»■ m 


empirical curve, — — -- ovumaieu numerically or Rrapiucauv. 

& 1 lf 0 mpU t 1D | “fhoda as well as meohanical devices oalled Aormonic anaW« 
have been worked out for this purpose. s 
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assign a new symbol for the variable of integration, since otherwise 
t would be used for two different purposes. If the integration variable 
is called a, then 

n = + ao Jlnfut 

= 2 — e m '"* / / (a )e~ Tdwk da 

n=— co-TT Jq 

n=+oo -2^/a, 

= 2 izL f da. . . . (21) 

n = — co - " 



We introduce the time of one complete vibration — the period 
T = 27 t/(o. Also, on account of the periodicity of the function f(t), 
we can take the range of integration from — T/2 to -\-Tj 2 instead of 
from 0 to T. Then there results 



At this point we can pass to non-periodic functions, which may be 
looked upon as extreme cases of periodic functions whose period 
extends from — oo to + oo . If we put 1/T= As, then 

/(«) = 2 As / (2a. . . (23) 

»--* -772 


J r® 

<f>(s)ds is defined as the limit, for As 
o 

infinitely small, of the sum 


2 <j>(nAs)As. 

n=*0 


Also, 




(j)(s)ds= 2 <£(nAs)As. 



From this result it follows that, as T grows beyond all bound, the 
expression (23) passes over into the Fourier Integral 


f(t) = f ds f f(a)e 2nitit -^da = f + e- Ut ds 
•'—00 •' — 00 •/ — 


r + °° 

f /(») 

J — 00 


— 2rrtla 


da. 



The second form discloses the meaning of this identity: the non- 
periodic function f may be expressed by a continuous series of har- 
monic periodic functions whose amplitudes are given as functions of 
the sequence number s by 



The possibility of such a representation is of great importance in the 
analytical treatment of functions which are otherwise not expressible 
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by a unified mathematical expression. Let there be given, for 
example, a simple harmonic vibration which, however, is not ot 
indefinite duration, i.e. we suppose it to start at t == 0 and cease at 
t= T. Such a vibration cannot be represented by e Lirmt , since this is 
the expression for a vibration existing from t = — co to t == + <f> • 
We can, nevertheless, insert the function /, so defined, in the Fourier 
Integral and determine the distribution of frequencies therein. This 
distribution has, naturally, a maximum for s=v. The shorter the time 
the vibration lasts, the wider the “ frequency band ” covered by it. 

By taking the quantities in pairs and applying the Euler Formula, 
as in the case of the Fourier Series (p. 55), we can reduce the complex 
form of the Fourier Integral to a real form: 


f(t)—2f ds f f(a)cos27r$(t—a)da. . . (26) 

JO J— oo 

Ex. 23. Develop, in a Fourier Series, the periodic function which has the value 
0 from 0 to T j2, and the value 1 from T /2 to T, then 0 once more, and so on — 
sometimes called a “ square wave 

Ex. 24. Express e -a/ cosG>J by a continuous series of undamped harmonic 
vibrations, and find the frequency of the component vibration for which the 
value of the square of the amplitude is half the maximum value. 


3. Modulated Vibrations and Beats. 

In wireless telephony we deal with “ modulated ” vibrations in 
the transmitter. These are oscillations in which the peak amplitude 
itself is a periodic function of the time. The amplitude, however, 
changes slowly compared with the frequency of the actual vibration. 
The latter vibration — the so-called “ carrier wave ” — has a frequency 
of the order of 1,000,000 sec -1 , while the frequency of modulation — 
which is the frequency of the radiated tone — is around 500 sec -1 . 
Using real functions, a modulated frequency of this kind may be 
represented by 

U = A COSoqJ COScOg/, (oi 2 oq). . . . (27) 


But, by a familiar formula of trigonometry, 

2 cosa^i cosco 2 £ = cos(to 2 + co i)J + cos(cu 2 — . (28) 


The use of this formula enables us to write (27) in the form 

A A 

u = _ cos (ct> 2 + + - cos(w 2 — w x )t. , 


(29) 


This signifies the combination of two vibrations of equal amplitudes, 
and having frequencies cu 2 + cu 1 and cu 2 — cuj, which are very close 
together. In practice, this type of vibration may be produced in 
either of these two ways, i.e. by modulating a carrier frequency, or 
by combining two vibrations which differ but little in frequency. The 
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latter case, because of its well-known acoustical manifestation, is 
known as the phenomenon of beats. The angular frequency of the 
beats is cu 2 — co v It is to be remembered, however, that the phenome- 
non itself is correctly represented by either (27) or (29). 

The occurrence of beats represents the simplest case of an almoat-periodic 
phenomenon. If, for example, o)j and o 2 are not commensurable, then, strictly 
speaking, the function u has no definite period, i.e. there is no fixed interval T 
suoh that the value of u at any time t is repeated at time t -f- T. Nevertheless, 
within a certain latitude of departure e, a “time of recurrence” may be defined. 


4. Combination of Vibrations along different Axes. Lissajous’ Figures. 

Thus far we have considered perfectly general periodic functions u , 
which could be represented either in a co-ordinate system with t as 
abscissa and u as ordinate, or in the Argand diagram of the complex 
plane. Now we wish to give our function a particular meaning: let u 
be the co-ordinate x of a point moving along the x-axis. If, now, the 
other co-ordinates (components of the radius vector) are also periodic 
functions of the time, we inquire the nature of the curve described 
by P, the extremity of the radius vector. We restrict ourselves to the 
important case of motion in a plane, the z-co-ordinate being omitted. 
The motion in the xy-plane is to be distinguished, of course, from 
rotation of the index in a vector diagram. If we desire to employ the 
latter representation here also, we must introduce a complex plane 
with rotating index for each co-ordinate. For the discussion of the 
orbit, we thus find it simpler to start from the real representation and 
write 

x=acoBcot, y = b cos(a>£ — 8). . . . (30) 

It is thus specified, for the present, that the two vibrations have the 
same frequency. We obtain widely different vibration figures for 
differing values of the phase difference 8. First, let us take 8=0. In 
this case, the point remains on the straight line 

y = h _ 

x o’ 


which is the diagonal of a rectangle with sides a and b. This motion 
is called the linear form of vibration. If we next take 8 between 0 and 
tc/ 2, elimination of t in (30) gives the equation of the orbit as 




x 


b - cos 8 -f- b A /1 

CL 


V 1 - 


X 


sin 8, 


a 





This is the equation of a second degree curve. 


From the parametric 
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form (30), it is evident that neither the x- nor the ^-co-ordinate can 
become infinite, i.e. the curve must be an ellipse. Its principal axes 
may be determined by the methods of analytical geometry. For 
8 = tc/ 2 we have 

x = a cos cjt, y = b sina>2. 

This, however, is the familiar parametric representation of the ellipse 

o 2 ^ b 2 ? 


whose principal axes are the co-ordinate axes. 

The direction in which the ellipse (30) is described may be seen 
from the parametric representation. If we start at t = 0, then P is at 
its greatest distance from 0 in the direction of the x-axis, i.e. the x of 
P is a maximum. As t increases, x decreases, but (if 8 is less than n) 
y increases in the direction of the positive y-axis. This form of vibra- 
tion is called a left-handed elliptic vibration. If 8 is allowed to increase 
to the value 7r, 

x — a cos co L y = — b cosevt, - = — -. 

x a 


This is the original linear vibration reflected in the t/-axis. Assigning 
8 the value 37 t/ 2 gives the same ellipse as for 8 = vr/2; but this curve 

is traversed in the opposite sense, and we 
have a case of right-handed elliptic motion. 
We again obtain the linear motion with 
which we started by putting 8 = 2 tt. The 
intermediate ellipses may be drawn with the 
aid of the above considerations (cf. fig. 3). 
If the amplitudes a and b are equal, we have 
the special case of motion in a circle. This 
may be either right-handed or left-handed. 

If we employ exponentials instead of trigonometric functions, 

x~ z.~ ae iH ) 

y = z a = } < 32 ) 



This shows that the ratio of the components of the radius vector is 

real only when the phase difference is 0 or ir, i.e. when the vibration 
is linear, for 


?=?» = 6 4 _« 
® a 



Thus the complex number representation provides a simple criterion of an 
euxptic vibration : the ratio of the components x and y is complex. 

If the two frequencies are not equal, the phase difference is oom 
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stantly changing. For the case where the frequency difference is 
small compared to the individual frequencies, the phase relation 
changes slowly, and we can write 

x = a cosaqC 

y = b cosfoq* -f (co 2 — uj^t] = b cos(a> 1 / -f- *0- 




t 

b 

I 


Fig. 4 


The phase difference 8 thus becomes a linear function of the time, 
and the point P traverses in smooth succession all the elliptical curves 
corresponding to the various values 
of 8. The resulting pattern, called 
a Lissajous figure , after its dis- 
coverer, has the property of gradually 
covering the entire rectangle of sides 
a and b (cf. fig. 4). The Lissajous 
figure is the simplest example of a 
conditionally periodic system. In this 
type of system — which is of impor- 
tance in Celestial Mechanics and, to a 
certain extent, also in Atomic Me- 
chanics — each co-ordinate of P is, in fact, a periodic function of the 
time. The orbit itself is closed only when a condition of commensura- 
bility connects the two periods. In this case alone can we speak of a 
periodic motion. 

Since any vector (e.g. the vector representing the electric field 
strength) may be represented as a radius vector, our considerations 
are perfectly general, and hold for the motion of the terminus of a 
vector, as well as for the motion of a material particle. It is further 
of importance to note that a right-handed and a left-handed circular 
motion of the same frequency and amplitude always combine to form 
a linear vibration, whose axis is determined by the phase difference 
of the two motions. If the left-handed motion is 


x 1 = a 0080 * 1,1 
t/i = a sincoi, J 


(35) 


and the right-handed motion is 


x 2 = a cos(t ut — 8), 
y 2 — — a sin (cut 


8 ), 1 

- 8)J • 


(36) 


then a simple trigonometric transformation gives 


Xj + x 2 = 2a cos - cos(a^ — 8/2) 


(37) 


Similarly, 


y\ + 2/2 = 2 a sin - cos (cut — 8/2). . . . (38) 
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This shows that the x- and ^-components of the resulting motion do 
not differ in phase, and the vibration is linear. The direction of the 
axis is given by division of the two equations: 


Vl+V2 
X \ + x 2 




This result is still more easily obtained by transferring the circular 
vibrations to the complex plane, where they are represented by 

% = ae Uut and z 2 — ae-'^-V. , . . (40) 

The sum becomes 

z 1 +z 2 = ae is/2 { e‘("*-«/ 2 ) _|_ } 

= 2ae ,s/2 cos(c ot — S/2) (41) 

This is a vibration taking place in a line making an angle S/2 with the 

real axis. 

This method, which is always applicable to 
circular vibrations, is not to be confused with the 
vector diagram introduced in § 1 (p. 52). There, 
only the real part of the complex number had 
physical significance; in circular vibration, the 
complex number itself gives the correct picture 
of the process. 

The same result is obtained diagrammatically 
by combining the instantaneous radius vectors, 
and again the geometry of the figure gives the 
angle S/2 (fig. 5). 

Ex. 25 Draw, for several values of the phase difference, the closed vibration 

figure resulting from combination of an x- and a y- vibration having the ratio 
of frequences 1:2. 



5. The Propagation of Periodic Disturbances in the form of Waves. 

. we have confined our attention to the particular point 

in space at which a given vibration is taking place; in what follows 
we shall consider all points of the region. First we take the simple 
case in which motion in a certain direction (call this the x-axis) is inde- 
pendent of the y- and z-co-ordinates, i.e. is a function of x alone. At 
® = 0 let 

u(t, 0) = Ae m (42) 

We call a process a wave if the same state of vibration exists at x = x 
as a x — 0, but with a difference of phase corresponding to the finite 
velocity of propagation v of the phase.* That is, at the time t, the 

• v is also referred to as the “ wave velocity 
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same condition obtains at x as held at 0 at the time t — x/v. We thus 
write * 

u(t, x) = Ae^-i) ( 43 ) 

In this way u becomes not only a function of the time, but also of 
the co-ordinate x. If we imagine a snapshot of the process to be made 
at a time t 0 , then the corresponding equation is 


X 

u(t Qy x) = Ae tullt e~ i<u v, 



The instantaneous value at 0 recurs if x increases by an integral 
multiple of 

2771; v 

CO V 


for in this case the argument of the imaginary exponential increases 
by 2 tt, and the function remains unchanged. This distance, which gives 
the separation of the points of equal phase, is called the wave-length X. 

From the above considerations, A obeys the fundamental equation of 
wave theory: 


Xv=v: (45) 

Wave-length X Frequency = velocity of 'propagation of the phase. 


A wave of the kind described, in which u depends, in addition to 
the time, only upon the co-ordinate of the direction of propagation, 
is called a plane wave , since u is constant in any plane perpendicular 
to the direction of propagation. The simple harmonic plane wave is 
a particular integral of a partial differential equation which is readily 

deduced. If u is differentiated twice with respect to t , and twice with 
respect to x , we obtain 


d he 

= ~ C0 2 A^ t ~ X,v '> 


and 


dhi 

dx 2 



^giuit — x/v) 



so that 


d 2 u 1 d 2 u 

dit^v^di 2 ' 



The general integral of this second order partial differential equation 
must contain two arbitrary functions. As may be verified readily, 
any function / of (t — x/v) satisfies the equation, but so does any 
function g of (t + x/v). The latter corresponds to a wave moving in 
the direction of the negative x-axis. The general integral is then 

( 48 > 


In the simplest case as taken here, the amplitude A is independent of the position 
in space. Is evertheless, A may also be a function of x. Thus, in an absorbing medium 
the amplitude falls off as e <**. Here, however, we consider only undamped waves 
which, in our case of dependence solely on x, have constant amplitude. 
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For a plane wave whose wave normal n (i.e. the perpendicular to 
the planes of equal phase) is arbitrarily oriented with respect to the 
axes, the planes of equal phase have the equation rn — const., and 
the plane wave is represented by 


= Aj“(r~ 


cos a + y cos ft + z cos 




f43'l 


If this equation is differentiated twice with respect to t and twice with 
respect to each co-ordinate, we can form the most general differential 
equation defining an undamped wave, of which (43') is only one 
particular integral. This equation turns out to be 


d 2 u cPu 

0y 2_h 


d 2 u A 1 d 2 u 
df*~ Au ~ v 2 dt 2 ' 



Another very important special case is that of a spherical wave; here 
the function depends only on the distance t from a centre 0. The trans- 
formation of Aw to this case— the special case of spherical polar co- 
ordinates when w depends on r only— is not difficult if we use the 
vector meaning of Aw, viz. div grad w. We have 


, , . du 1 

grad w(r) = J f ~ r 


, , x du 1 , rr 

div grad w(r) = “ div r H — ~ 


d 2 u 1 
W 2 r 


du 1\ 
dr r 2 ) 


dhi 2 du 

“ iP + r dr * 


Then (49) becomes ga w 2 du 1 3 2 « 

d? + r Yr ~ W ' ' 

. . . (50) 

A particular integral is 


u = -eK‘“£), • . . 

T 

. . . (51) 

as may be seen readily by substitution, or by writing the equation 
the form 

0* 13 2 , . 

5? {ni) “ * 5? (n,) - 



This is a simple harmonio wave spreading equally along radii in 
all directions from the origin. In contrast with plane waves, the 
amplitude falls off as 1/r, corresponding to the inorease of the wave 
surfaces. The surfaces of constant phase are given by r = const., 
and are therefore spheres. The general solution, consisting of one 
wave emanating from 0 and another closing in towards 0, is 
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u = (l /r){f{t — r/v) + g(t + r/v)}. . . . (52) 

Ex. 26. Give the differential equation of a cylindrical wave, and find its 
integral. 


6. Combination of several Waves having the same direction of Propaga- 
tion; Linearly and Elliptically Polarized Waves; Group Velocity. 


(a) Phase velocity independent of the frequency; absence of dispersion 

If u at x = 0 is a complicated function of the time, representable 
by a Fourier Series, and if there is no dispersion, then the phenomenon 
is repeated at every point with a phase difference corresponding to the 
velocity of propagation. In order to represent such a wave, we have 
only to replace t by t — x/v in the Fourier Series. If the quantity 
satisfying the wave equation is of the nature of a vector, we speak of 
pure longitudinal waves if this vector is along the direction of propa- 
gation, i.e. the x-axis. In this case everything is cylindrically sym- 
metrical around the x-axis, and there is no uniquely distinguished direc- 
tion in the planes perpendicular to this axis. If, on the other hand, 
the vector has no component in the x-direction, we speak of pure 
transverse waves. If there is no phase difference at 0 between the two 
components of the vector — i.e. if the vibration is linear, according to 
§ 4 (p. 58)— it remains linear at every point, if the phase velocity is 
the same for both components. In this case we speak of linearly 
polarized waves , and refer to the plane determined by the directions of 
propagation and vibration as the plane of vibration. If the vibration 
is elliptic at 0, it is elliptic, under the same conditions, at all points in 
space, and we have an elliptically polarized wave , of which a special 
case is the circularly polarized wave. 

The case in which the phase velocity is somewhat different for the 

two components is of importance. If we again represent the vectorial 

point function by means of a radius vector in the planes perpendicular 
to the x-axis, 


y = a cosoj 


z—b COS Cl) 



Comparison with formulae (30) (p. 58) shows that the phase difference 
varies with the position along the x-axis, independently of the time 
(cf. fig. 6). Points of equal phase difference, and therefore having the 
same vibration figure (e.g. a straight line), are apart a distance A, 
given by 






1 




Similarly, we obtain a phase difference varying with the co-ordinate 
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if we have two oppositely directed circularly polarized waves of equal 
amplitude, but having different velocities of propagation. According 



fco § 4 (p. 60), two such vibrations always combine to give a linear 
vibration, and the position of the axis of vibration is given by the 
phase difference. If the latter varies from place to place, this signifies 
a gradual azimuthal rotation of the axis of vibration as we move in 
the direction of propagation. 


(6) Dispersion; phase velocity dependent upon frequency 

If the partial vibrations composing a complicated vibrational 
phenomenon are propagated with different velocities, we obtain in 
general, by superposition of all these vibrations, a non-periodic con- 
dition, concerning which little can be said. It is only when the 
partial vibrations are close together in frequency that the nature of 
the result is clear. The simplest case is that of two slightly different 
vibrations of equal amplitude which produce a modulated vibration 
by means of beats. As long as there is no dispersion, the modulation 
(in radio, the audio-frequency note) is, of course, transmitted with the 
same velocity as the carrier wave. If, however, dispersion exists, we 
obtain another velocity — the group velocity — as the following simple 
computation shows: 

o)j = o) + Aoj and v x = v ^ Acu. 


Let 


du) 


Then the modulated wave becomes 
F(x , t) = cos w 1 ^ -f cos cu ^ 


We may set w 1 = w and = v in the sum, without appreciable error. 
Then 

J <* i> - 2 «• {x < “ i - f) -}• 


(8 711) 
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Neglecting higher terms, 


CO ^ CO /I CO dv \ 

v 1 v v \v v 2 dco ) 


Thus one obtains 

F(x, t)= 2 cos^ (t — 

where, for brevity, we have set 

d- 

1 co dv v _ 

v v 2 dco dco 


— 1 coso> l t 


d- . 

v 1 


F* 


Ao>. 



The meaning of this result is as follows: We have a wave of 
angular frequency co; its amplitude is subject to a wave-like periodic 
variation * of angular frequency Acj. While the phase of the wave is 
propagated with velocity v , the velocity of propagation of the modu- 
lation has the value F, determined by 


1 

V 




It is useful to express the group velocity also in terms of X. Since t>/v = X, 
(64) may be written 

V = = _ x 2 — 

d( 1/X) d\’ 

and since vX = v, 

, iv , dv 

x dx + v== 5x- 


Substituting the value of X(dv/dX) into the above, we obtain 



(64') 


The velocity V is, in general, identical with the signal velocity, 
i.e. the velocity of propagation of a signal. If we have a vibration at 
the origin 0 starting at time t = 0 and terminating at t = T, the 
function which represents it is in general non-periodic, but may be 
represented by a continuous sequence of harmonic functions, accord- 
ing to § 2 (p. 56). If the duration T of the original vibration is 
small, we obtain a narrow band of side frequencies, for which, if the 
dispersion is not too great, the derivative dv/dco may be regarded 
as constant. In the same way as a modulated wave results from 
superposition of two slightly different primary waves, a vibration 

* Because the algebraic sign of the amplitude is of no concern, the frequency is not 
Aoj/ 2 but Ao> (cf. p. 56). 
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lasting for a limited time at any one point results in this case by 
superposition of all the partial frequencies. The velocity of propa- 
gation of the limited wave train is then given by V. Since signals can 
be given only by initiating or ter min ating a vibration process, and 
not by individual crests or troughs of an unlimited wave, V is the 
velocity of propagation of acoustical or optical signals, as obtained 
by measurement. 

In the region of anomalous dispersion, the signal velocity, i.e. the velocity 
of propagation of the energy, no longer coincides with the group velooity F, whioh 
loses its significance in this instance. 

Ex. 27. How must the phase velocity v depend on v, in order that the group 
velocity F be proportional to the reciprocal of the phase velocity ? 

7. Combination of Waves having the same Frequency but different 
directions of Propagation. Standing Waves. 

(a) Superposition of two waves moving in opposite directions 

Let tq = Ae iu( ‘- Xlv) 

represent a wave travelling in the direction of the positive x-axis, 
and let 

u 2 = Ae**‘+*'v)+ a . 

represent a wave of the same amplitude and frequency travelling 
in the direction of the negative x-axis. The sum is 

« 1 + Ma =4e I M{e- , (“ + i) +e < (f + l)} 

= 24cos^+|y<“>+*'*> (55) 

This result is to be interpreted as follows: 

represents a vibration having the same phase all through space. The 
factor 



indicates that the amplitude is a periodic function of x. The period 
is v/v= A. The chief characteristic of a wave, viz. the finite velocity 
of propagation of the phase, is entirely absent. For this reason, the 
phenomenon is called a standing wave. If the two amplitudes are 
unequal, there exists, in addition to the standing wave, a wave travel- 
ling in the direction of the stronger partial wave, whose amplitude is 
the difference of the amplitudes of the components. 
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(b) Superposition of two plane waves meeting at an angle 

A case arising especially frequently in optics is that of a plane wave 
reflected from a plane surface which we shall take to be the xy-plane 
of a co-ordinate system. Call the angle between the wave-normal and 
the z-axis a. If we let the wave-normal n fall in the yz- plane, as in 
fig. 7, then, by the laws of optics and acoustics, the normal to the 
reflected wave, n\ will also lie in the 7/2-plane, and will also make an 
angle a with the 2 -axis. For the incident wave, the surfaces of equal 
phase are the planes perpendicular to n. These planes have the 
equation rn = const. The equation of a wave moving in the direc- 
tion of the unit vector 

n = (cos a) i -f- (cos p)j -f- (cos y) k 

( ._ra\ . f,_ xcosa+ycosP + z cosy ^ 

v / = Ae *"V v '• 

From this equation there follows at once, by putting n = i, the pre- 



vious form of a wave moving in the x-direction. In our case we have 
(cf. fig. 7), 

n = (sin a)j — (cosa)&; n! = (sina)y + (cosa)£. 

By superposition of the two waves we obtain 

, . (l— y sina— icosa\ . . (t— y sina + z cos a\ 

Mj-f u 2 = de*“l 1 ) -f AefA 1 

= 2AA-^) cos ( wZ C v ° 3a 

This is a wave travelling in the direction of the 7/-axis, the wave-front 
being modulated with the period A/cos a. The phase velocity of this 
combination wave is v/sin a. For a = 0 we again have the case of a 
standing wave. 




CHAPTER m 

Selected Topics in the Theory of Functions 

of a Complex Variable 

1. Conformal Mapping of one Plane on another. 

Since many problems of theoretical physics may be solved par- 
ticularly simply with the aid of the Theory of Complex Functions 
we give here a rapid account of some of the most useful results ol 
function theory. For rigorous deductions of these results, however, 
the reader should consult textbooks on the subject. 



If the argument of a function, e.g. the sine function, is a complex 
number z = x + iy, then, in general, the function will also be a com- 
plex number to = u + iv. If we represent the complex number z by 
a point in a plane — the Gauss plane or Argand diagram — and the 
complex number to by a point in a second plane, then the function 
f(z) establishes a correspondence of the points of the w-plane with those 
of the 2-plane, and we say that one plane is mapped on the other. 
The function /(z) is said to be monogenic at those points where to is 
finite and where the limit 


f(z) = lira ft 

A Z 



is finite and independent of the way in whioh the point z is approaohed 
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as A z tends to zero. We now state a theorem: Within a region in 
which w is regular, the z-plane may be mapped on the w-plane, -with 
the exception of those points where f'{z ) = 0, by means of a function 
w=f(z), in such a way that corresponding small portions of each 
plane are similar and angles are preserved (conformal mapping). For 
a proof, we consider a small triangle in the z-plane, determined by 
the points z 0 , z 0 -f- \z and z 0 -f- A 2 z (fig. 1). These points correspond 
to the points w 0 , w Q -f A x w = w 0 -f- /'(z 0 ) \ z an ^ w 0 + — 

w o "1" /"(ZqJAzZ, in the w-plane. Thus, the quantities and A 2 w 
are obtained by multiplying the complex numbers A x z and AgZ by the 
same complex number /'(z 0 ). But this means that \z and AgZ are 
turned through the same angle and elongated in the same ratio.* 
The similarity of the two triangles is thus demonstrated; all angles 


z Plane. w Plane 



Fig. 2 


are preserved in the mapping. The type of mapping represented by 
the function w = / (z) is therefore termed conformal. The lines 
x = const, and y = const, in the z-plane form a rectilinear rectangular 
co-ordinate system. On account of the preservation of angles, these 
straight lines correspond to two families of mutually perpendicular 
curves in the w-plane. As an example, consider the function 

u -f iv = w = cosz = i(e“ + e”“) = + £e _uc +v 


that is 


= coshy cosz — i sinhy sinx, 
ti — coshy cosx, v = — sinhy sinx. 


Elimination of x yields 



v 2 

sinh 2 y 



* We see at once that the multiplication two complex numbers represents a 
rotation and a change of length. If z, - and 2, - p t e* 4 '*, then 2,2, - + 
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Similarly by eliminating y> 

u 2 v 2 __ ^ 

cos 2 a; sin 2 a; 

The curves x = const, thus become, in the w-plane, hyperbolas with 
foci at —1 and +1; the curves y — const, become ellipses having the 
same foci (fig. 2). It is known from analytic geometry that these con- 
focal conics form an orthogonal system. The strip of the z-plane 
between x — 0 and x — tt maps itself into the entire w-plane. lo 
represent the entire z-plane requires a w-plane of infinitely many 

sheets. 


Ex. 28. Discuss the traj 


formation w = * -j- 



2. The Cauchy-Riemann Conditions and the Differential Equation of 
Laplace. 

The real part u and the imaginary part v of w = u + iv are func- 
tions of the two variables x and y. Nevertheless, two arbitrary func- 
tions u(x , y) and v(x, y) cannot, in general, be considered to be the 
real and imaginary parts of a function of a complex variable. If the 
functions u and v originate in a complex function, they satisfy certain 
special conditions. If we denote by f'(z) the derivative of the function 
f with respect to its argument, then, since u -f- iv =f(x + iy) =/(z), 


0 U 
dx 








Comparison gives the Cauchy-Riemann Differential Equations 


du dv \ 
dx 

du dv 

dy dx . 




These relationships must hold between u and v, but neither of the 
two functions may be chosen at will. If we differentiate the first of 
the above equations with respect to x, the second with respeot to y, 
and add, we obtain Laplace's Differential Equation 


d z u 0 2 u . 

a ® 2 + ~ Au ~ °- 


( 3 ) 


Similarly, by differentiating the first with respeot to y and the second 
with respeot to ®, and subtracting. 
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Hence both the real and the imaginary parts of the complex function 
w satisfy Laplace’s equation: A u = 0, Au = 0. 



P, 


3. Line Integrals in the Gauss Plane; the Cauchy Integral Theorem. 

Form the integral f f(z)dz between two points P 0 and P 1 of t 

curve C in the z-plane (fig. 3). Cauchy’s 
theorem is: The integral along a second curve 
C ', between the same points, has the same 
value if the function /(z) is monogenic at all 
points in the region bounded by the two 
curves; or, what is the same thing, the con- 
tour integral ^>f(z)dz vanishes if the function 

is monogenic in the enclosed region. To 
prove this statement, we separate real and imaginary parts: 

£ (u + iv) ( dx + idy) = £ (udx — vdy) -f i j) ( vdx -f- udy) 

= £ F ds -j- i j) F' ds. 

The first integral represents a plane line integral j) F ds if we put 

F x =u and F v = — v. But by Stokes’s Theorem (p. 27), since d S 
has a z-component only, 

^ F is =/curl F dS =JI dxdy, 

and similarly, for the second integral, 

j F' ds =/cuil F' dS =fl dxdy. 


However, by the Cauchy-Riemann equations (2), the integrands in 
both double integrals vanish identically, which proves the theorem. 
Further, the following theorem may be proved: If the line integral 
along a closed curve C does not vanish, but has a finite value, on 
account of the existence of a singularity of the function in the region 
enclosed by C, then the same value is obtained by integrating along 
another closed curve C\ provided the function is monogenic in the 
region between the two curves. The two curves C and C' (fig. 4) form 
the boundary of a region to which Stokes’s Theorem may be applied, 
since the function / (and therefore also v and v) are regular within 
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this region * It is to be remembered, however, that in a PP^SStokess 
Theorem, the boundary curve is to be traversed m such direction t 
the bounded surface is always on the left. Hence the direction in which 
the inner surface is described must be reversed; or, if the curves are 

traversed in the same sense, 



If there is but one singular point z 0 , we may therefore integrate around 
a small circle of radius p, with centre at z 0 . In the greater number 
of cases, the singularity at z 0 consists in the function becoming 



infinite to a certain definite order n; i.e. a power series develop- 
ment at Zq contains terms ( z — 2 0 ) -1 , . . . (z — z 0 )~ n . It may be shown 
at once that the value of the integral depends only upon the term 
( z *— z 0 )~\ for if we put (cf. fig. 5) 


then 


z — z Q — pet*, dz = tpe d<j>, 


<f> (z - 2 0 )" dz = f 2 ’ t>” +1 e^+W d4 = 
j Jq n -f- 1 



On account of the periodicity of the complex exponential function, 
the integrals vanish for n =|= — 1, but for n = — 1, 





• • • l 



If the path of integration encloses a singularity z 0 of the function f, the 

♦From a strict, mathematical viewpoint it is allowable to apply the Stokes’s 
Theorem only after joining the two ourves by a “ out In forming the lino integral 
this cut will be traversed twioe in opposite directions, hence contributes nothing to 
the integral. By inserting the out we get a single, continuous, bo unding ourve that 
be traversed in one operation. 


74 


MATHEMATICAL INTRODUCTION [Chap. 111. 



value of the complete line integral is 27ri times the coefficient of the term 
• (z — z 0 ) -1 of the po*ver series development (in powers of z — z 0 ), which 
holds in the neighbourhood of z 0 . This coefficient is called the residue 
of the function at z 0 . If several singularities are enclosed, it is readily 
shown that the value of the integral is equal to the sum of the separate 
residues multiplied by 2 rri. If the point z = co is enclosed, the sub- 
stitution z = 1/t , dz = —dt/t 2 is made, which reduces the point at 
infinity to the zero-point. Again the essential coefficient is that of 
the term in z _1 , for this yields the term in 1/t upon making the above 
substitution. 

Ex. 29. Using the Cauohy Theorem, evaluate the integral 



Jo 1 + e cos 9 


by expressing cos 9 in terms of et* and introducing e 1 * = z as a new variable. 


CHAPTER TV 


The Fundamental Problem of the Calculus of 

Variations and its Solution 


I. Statement of the Problem of the Calculus of Variations. 

The question “ For what values of the independent variable has 
a given function y = f{x) a turning value?” is treated in elementary 
differential calculus. A necessary condition is the vanishing of the 
first derivative. Similarly, a necessary condition for an extreme value 
— i.e. a ma ximum or minimum , or turning value, or extremum — of a 
function of several variables is the vanishing of all partial derivatives 
of the first order dfjdx , df/dy. ... The following question is much 
more difficult to answer: “ Given a definite integral whose integrand 
is a function of x y y and of the first derivative y'l 


= f ' F(x, y, y) dx-, 


for what function y(x) is the value of the integral a ma xim u m or a 
minimum?” In contrast with the simple extreme- value problem of 
differential calculus, the function y(x) is not known here, but is to be 
determined in such way that the integral is an extremum. In geo- 
metry and in physics problems of this type meet us at every turn. 
The simplest example is furnished by the question: “ What is the 
shortest curve which can be drawn joining two given points?” In a 
plane, the answer may be given immediately: “ The straight line.” 
But if the two points and their connecting curve are to lie on a given 
arbitrary surface, then the analytic equation of this curve — which is 
called a geodesic — is found only by solution of the above extreme- 
value problem, which is the fundamental problem of the Calculus of 
Variations. 


2. Derivation of the Euler-Lagrange Differential Equation. 

We shall now show how the extremal problem of the caloulus of 
variations may be reduced to the determination of the extreme value 
of a known function. For this purpose we consider functions p of x, 
which are “ neighbouring ” functions to y. The function jf is arrived 
at as follows: Let € be a small quantity, and let rj(x) be an arbitrary 
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yfr) 


function of x, continuous, along with its first two derivatives, in the 
range of integration. Then we introduce into the integral, in place of 
y and y\ the neighbouring functions y = y -f- erj and y = y' + erf. 
We stipulate, however, that these 

comparison fimctions “y coincide with __ Pi 

the function y(x) at the end points y( x ) 

of the range of integration (fig. 1 ). 

We must therefore require of the / / y(X) 

arbitrary function 77 that it vanish at f / 

the ends of the interval; for if the W 

various curves connecting two given P 0 

points are compared, an obvious con- 
dition is that all curves to be com- 

pared shall pass through these two 0 

points, and that no curves which do Fig . , 

not pass through these points shall 

be considered. If we substitute the neighbouring functions in the 
integral 7, the integral becomes a function of e. We then require 
that y(x) make the integral an extremum, i.e. the function 7(e) must 
have an extreme value for € = 0 . 

/ *i 

F(x, y + €r 1> y' + e v) fa = extremum for e = 0 (1) 


Fig. 1 


This gives us a simple method of 
given integral. The condition is 


determining the extreme value of a 


(f).r, 


( 2 ) 


We expand the integrand function F in a Taylor's Series , according 
to powers of e, and obtain 

1(e) =j X ' y, y') -f ery + er( ^ -f terms in c 2 , e 3 . . . J dx. (3) 

If we differentiate with respect to c under the integral sign, as is allow- 
able, we obtain 

i =f{ v f ^ S + terma in c ’ • • •} ^ • < 4 > 


This expression is to vanish for € = 0. 
this of themselves. Thus there remains 
extremum: 


The terms in € do 
the condition for an 


r (’ ? - + v s) - - 




. . ( 5 ) 
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The second term may be transformed by integration by parts: 



The first term vanishes since 77 is zero at the limits. Then 




On account of the fact that 77 is perfectly arbitrary, the integral can 
vanish only if the bracketed expression in the integrand is zero. 
This gives the Euler-Lagrange Differential Equation : 

d_ dF(x, y , y') _ dF(x , y , y') _ 0 ... (7) 

dx dy' d-y 


which must be satisfied by y, if y is to make the integral an extremum. 
The essentially difficult investigation of the nature of the extremum 
(whether maximum or minimum) seldom arises in physics, and need 
not be treated here. On the other hand, the case where F is a function 
of several dependent variables y k and their derivatives is of great 
importance. In analogy with the above, we introduce as neighbouring 

functions 

9i= s Vi + € iVi> • • • = 2/* + • • • it =2//+ € fVft 


where the 77^ are again to vanish at the limi ts of the integral. The 
integral then becomes a function of the f variables . . . €/. The 
condition for an extremum is, of course, the vanishing of the partial 
derivatives 


9 / (61, g 2 , . . . € f ) 

0 €* 



d dF\, 
dx dy\) 


when 



It follows as before that the coefficient of each of the functions i\ 
within the integral sign must vanish, i.e. 


d dF _ dF 

dxdy\ dy k 


(& = 1 , 2 , . . ./). 


The Euler-Lagrange equation therefore holds for each of the dependent 
7ariables. 


The difference 


n «) - m = * d 4i 
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for e small, is usually termed the variation of the integral, and is written 


8 J F ^ X ’ V ’ y 


We then write, instead of 


T(x, y, y')dx = extremum, 

x. 

the equation expressing the vanishing of the variation: 

V * = f y > y ') dx = °» 

for which the Euler-Lagrange equation must be satisfied, just as for (1) above. 
We call SF the variation of F. An important conclusion may be drawn 

from this formulation of the variation problem. Let F be the total derivative 
of a function 0(x , y): 

F (x, y , y') = ^ G ( x > y )• 

In this case the integral may be evaluated immediately: 

J f x i r * t fin 

X, Fdx = Jx, di dx= 0{Xl * yi) “ °( x °’ yo) * 

Since the varied function coincides with the original function at the limits, we 
have for every connexion between y and xi 

8 / F dx= BO(x lt yj- 8G(x 0 , y 0 ) = 0. 

That is, any arbitrary functional relationship y(x) satisfies the condition that 
the variation of the integral shall vanish, and the extremal problem no longer 
has any meaning. ® 

Ex. 30. Prove the following: If, in the integral whose extreme value is sought, 
the integrand F does not contain x explicitly, then a first integral is 


F dF 
* — y = const. 

dy 



PART II 

Mechanics 



CHAPTER V 

The Mechanics of a Single Particle 



1 The Fundamental Concepts of Kinematics. 

' We shall concern ourselves, in this first chapter on mechamcs with 
the motion of bodies whose extension in space may be neglected, and 


v+dv 


Orbit 


Hodograph 


which may therefore be re- 
presented schematically by 
means of particles, or 
points endowed with mass 
(cf. § 2, p. 84). The posi- 
tion of a particle is indi- 
cated by a radius vector r 
drawn from a fixed point 
0. If necessary, we may 
resolve this vector into 
components. At the time 
t, let the particle P be at 
a point in space specified 
by the vector r. At the 
time t -f- dt the vector is 
r+ dr (fig. 1). The velocity _ 

is the vector v resulting from taking the lim it of dyjdt for dt infin itely 
small, i.e. 



Fig. * 


v = 


dr 

dt 


= r. 


a) 


The direction of the velocity vector is thus given by the small vector 
dr having the direction of the tangent to the path. Now, in general, 
the vector v will itself be a function of the time. We obtain a vivid 
picture of the trend of the velocity if we draw the vectors v from a 
new centre 0 1 as radius vectors. Then there is a definite terminus of 
a radius vector v corresponding to every instant of time. If all these 
points are joined, we obtain a new curve, called the hodograph of the 
motion. While the particle P traverses the actual orbit, the “ image 
point of the velocity ”, as defined above, moves over the hodograph. 

* When not explicitly written oat, each differentiation with respect to time will be 
indicated by a dot over the appropriate quantity. 
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The acceleration is the limit of the change in v per unit time, i.e. 


(1 v . d 2 r 

a = S= v= W = r - 


( 2 ) 


Since v is the radius vector of the point on the hodograph, the accelera- 
tion of the particle itself is equal to the velocity of its image point on 
the hodograph. 

The velocity v is said to be uniform only if the vector v is constant. 
This definition requires that the motion be rectilinear. Motion on a 
curve in which the magnitude of the velocity is constant — i.e. where 
equal arcs are traversed in equal times — can nevertheless not be said 
to have uniform velocity. 

We may resolve the acceleration in various ways: 


(a) Tangential and normal acceleration: 





where t is the unit tangent vector (p. 18) and v is the magnitude of v. 
Differentiation gives 


_ dv , dt ds d 2 s 

a dt V ds dt dt 2 


v 


t -) — n 
P 



[cf. eqn. (32), p. 19]. Here n signifies the unit normal vector pointing 
toward the concave side of the curve, and p is the radius of curvature. 
The tangential acceleration has the magnitude 




The normal acceleration, directed toward the inside of the path, is 

V 2 

«n = - (6) 

P 

From (4) it is evident that the acceleration is always in the osculating 
plane. 

(6) In rectangular co-ordinates: 

y=xi + yj + zk, ] 

r=xi+yj+zk , [ (7) 

r=xi+yj+zk. J 

That is, 

v x = X, V v = y, V z = z \ 

a*= x, a v =y , a t = z I * # * 


. • ( 8 ) 
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(c) The resolution in plane polar co-ordinates r, <j> is also of im- 
portance. For this purpose we introduce two mutually perpendicular 
unit vectors e r and e+. These vectors have, respectively, the direc- 
tions of increasing r and increasing <f>, and unlike the vectors i, j, k t 



change their direction with time. Since the derivative of a unit vector 
is perpendicular to the vector, we have (cf. fig. 2) 


de r _ d<j> de * _ 
dt * dt' dt ~ 



d<j> 

‘ dt 9 


and we obtain, from r ~ re r , 

r= fe r + re r = fe r + </> re 
and, by a second differentiation, 

r— fe,+ fe r + <fre t + <j>re<,+ 
= (f- r^ 2 )e r + (<fr+ 2<£r)e*. 

Thus 

a r — r — r</ 2 
• • « 

= <f>r -f- 2 <f>f 



2. Newton’s Second Law of Motion. 



first of Newton’s laws states that every body tends to remain at 

uniform motion in a straight line, unless acted upon by external 

This law, commonly oalled the Law of Inertia, is by no 

but represents an ingenious extrapolation of our 

for we cannot free a body completely from 

the gravitational effects of the eartband 
present. , . 


(a) The law of inertia 
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(6) Force and acceleration 

If we wish to alter the state of motion of a body, e.g. a sphere 
resting on a horizontal plane, by the use of our muscles, then a cer- 
tain effort, increasing with the resulting change in velocity, is re- 
quired. This effort, which we call force, is a direct sensory impression, 
and hence not further definable. The direction of the acceleration is 
determined by the direction in which we allow our muscles to work, 
so that force, like acceleration, is a vector. But from common 
experience we know that different amounts of force are necessary to 
produce a given acceleration, depending upon the nature of the body 
acted upon, e.g. a sphere of iron or of wood. Reciprocally, the effect 
of a given force depends upon the nature of the body to which it is 
applied. The direction of the acceleration, however, is the same for all 
bodies. We must therefore assign to each body a scalar property 
which we call the inertial mass m. The simplest equation embracing 
these factors is 



Since we have introduced no factor of proportionality, the unit 

of force is determined by using the fundamental C.G.S. units, 

i.e. length in cm., mass in gm., and time in sec. The force which 

imparts to 1 gm. of mass an acceleration of 1 cm. /sec. 2 is called 
1 dyne. 

Throughout Newtonian Mechanics, to which we limit ourselves for 
the present, it is tacitly assumed that the inertial mass m is a constant 
individual property of a body. A more cautious way of writing the 
fundamental law (10) leaves this question open: 

„ d . . du 

F= * (mw) =w < 10 > 


The quantity u = mv introduced here is called the momentum. New- 
ton’s second law, stated in terms of momentum, is then: The force is 
equal to the change in momentum per second * 

All about us in nature we see changes taking place in the state of 
motion of bodies, without exertion of muscular effort on our part. 
We interpret the cause of these changes as “ forces ” which operate 
on the bodies in the same way as our muscles do. The investigation 
of these forces, which may be of a very diverse nature, is an important 
problem in physics. In mechanics, however, we take the force to be 


. * ^ more complete general statement reads: “ Rate of change of momentum ia propor- 
tional to the force producing it, and this change takes place in the direction in which the 
force acts." 
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given, and confine ourselves to computing the effects. If several forces 
act on a body, each produces its own acceleration: 

F, = ma { . 

But since the resultant acceleration is the vector sum of the separate 
accelerations, and the inertial mass m is the same for all, 

ma = mlLdi— S F< = F. 

We may therefore replace all the separate applied forces F< by their 
resultant F in order to calculate the acceleration. 

Like most physical laws, the Newtonian Law of Motion has the 
form of a differential equation. The variation in any interval of time 
is obtained from this equation by continuously repeated application 
of the law, starting with given initial conditions, i.e. by integration of 
the differential equation. The unique power of theoretical physics is to 
be found in the circumstance that the laws of nature can be formu- 
lated in terms of differential equations. The fact that we can deduce 
descriptions of such apparently diverse phenomena as free fall, pro- 
jectile motion and planetary motion from a single law depends on the 
circumstance that what happens in a short interval of time in any of 
these cases is deducible from the same relation. 

In most cases eqn. (10) (p. 84) must be expressed in terms of the 
co-ordinates appropriate to the problem. Thus we obtain, in general, 
a system of three differential equations. Since the solution of three 
differential equations of the second order contains six constants, a 
complete description of the phenomenon is obtained only if these six 
constants are known. These are uniquely determined by giving the 
initial position and initial velocity, each having three components. 

Ex. 31. A point moves subject to a periodic force F = A sin of. Compute the 
motion for various initial velocities. 

Ex. 32. The force acting on every mass m at the earth’s surface is propor- 
tional to m and is directed toward the centre of the earth. In a limited region 
we can take the force to be constant, and the earth’s surface to be plane. Inte- 
grate the equations of motion for this case, for various values of the initial velo- 
city, and draw the orbit and the hodograph of the motion. (Theory of projectiles 
in vacuo.) 

3. Time Integral and Path Integral of the Force. Work and Energy. 

(a) The time integral 

We can make two different statements for the effect of a force 
acting on a particle for a certain length of time, each of which leads 
to relationships having meaning. First, we can multiply the force 
by the time during which it acts; secondly, we can take the scalar 
product of the force by the vector displacement which it causes. 
If the force itself is variable, then instead of the finite time interval 
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we have the sum of the products of the instantaneous time values 
of the force by the time element, i.e. the integral JF dt. A similar 
formulation holds for the second product. If we apply the fundamental 
law, the time integral becomes 




The time integral of the force (also called the impulse * of the force) is 
equal to the change of momentum. 

The impulse law (11) is especially important for a large force which 
acts for a very short time At, so that the product FA< remains finite, 
and thus causes a finite change of momentum. Observation of the 
change of momentum gives the product FA t immediately, without 
requiring a knowledge of the way the force varies. Here F repre- 
sents a mean value of the force, viz. the time average. 


( 6 ) The jpath integral of the force 


If a force F acts on a particle, the work done by F when the 

particle moves through a displacement f dr is defined to be the scalar 
product F dr. 

For a finite displacement along a curve C, the work is the sum of 
such contributions F dr, i.e. we have, for the work, the integral 



If we substitute for F the equal vector m{d?r/dt 2 ), and introduce the 

time as variable of integration by using dr = {dr/dtidt, then we 
obtain 




d 2 r dr 

it * * 


~j h - V). • • • 03) 

TEe entire amount of work done by the force is thus given by the 
difference between the initial and final values of the expression fymv 2 , 
which we call the kinetic energy of the particle. Thus the total work 
df>ne by the force is equal to the difference of the kinetic energy in the 
initial and final states. From the definition of work, and the funda- 
mental property of the scalar product, it follows that a force which 
is always perpendicular to the orbit of a particle can do no work, i.e. 


* In view of equation (11), the term “ impulse ” has como to be used not only for 

StoffiMl £ equation!^ a ‘ th ° Ugh ^ to 

pJtionTt^ pa“J ^™n e b/rh“ua 1^7. * ““ the 
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can cause no change in the kinetic energy. Such a force can change the 
direction of the velocity, but not its amount. 

Ex. 33. A coin lies on top of a card which in turn rests on a dr inkin g-glass. 
Why does the coin remain on the card if the latter is slowly drawn away, while 
if the card is snapped away suddenly, the coin falls into the glass? (Remember 
the empirical fact that the friction between solid bodies is independent of the 
relative velocity.) 

Ex. 34. Decide whether or not the fatigue experienced in holding a weight 
with arm outstretched has any connexion with work done. 

4. Conservative Forces; Potential. 

Consider a particle P in a field of force, i.e. in a portion of space 
at every point of which there exists a force F, uniquely determined 
by the co-ordinates. Let the particle P be moved from P 0 to P 1 (fig. 3). 
We divide forces into two classes, ac- 
cording as the work done in the dis- 
placement does not or does depend 
upon the path taken. This difference 
may be characterized somewhat dif- 
ferently if we consider also a return 
path PjPq. With the first kind of Fig. 3 

force, the integral along P 1 P 0 has the 

opposite value, since the curve is traversed in the opposite sense. 
If, then, we traverse the closed curve PqPjPq, the integral vanishes 
if the force is of the first kind. Forces of this type — to which, 
for example, the force of gravity belongs — are called conservative 
forces. If a body is projected vertically upward in a vacuum, 
with a given initial velocity, it will return to the starting-point with 
the same speed; the kinetic energy has not changed, and no work 
has been done on the complete path. The situation is quite different 
if the resistance of air or of any fluid operates. In this case the body 
returns to its point of projection with greatly reduced speed, and the 

integral j) Fdr cannot vanish, according to equation (13), p. 86. 

Forces belonging to this second class are termed non-conservative. The 
names originate from the fact that, in the first case, the total me- 
chanical energy is conserved — there is merely an interchange between 
kinetic and potential energy (cf. below) — while in the second instance, 
a transformation to other energy forms (principally heat) takes 
place. 

When forces are conservative, their discussion can be simplified 
by introducing the “ potential ”. If the work done on a particle by 
the field of force during a displacement is independent of the path, 
then we can assign to every point of space the value of the work done 
in a displacement from a “ standard position ” to the point T. 
Thus, besides the vector field of the force (determined by three num- 
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bers), we have a much simpler scalar field. For practical reasons we 
bake the negative of the work done by the field of force and call it the 
Potential U. The change in U corresponding to a small displacement 
dr is dr grad U. The work done is then 

dW = Fdr = —dU = —(guidU)dr, . . . (14) 

i-e- F = — grad U (14') 

Every conservative force may be represented as the negative of the gradient 
of a scalar point function , the potential. For a finite displacement we 
have, by equation (13), 

W=f r 'Fdr=lm(v 2 *-v*)=- f' (grad V) dr = U x - U 2 . (15) 

J f\ J r, 

Denoting the kinetic energy \mv l by T , we have 

T 2 + U 2 = T x -f U x = const (16) 


This is the Principle of the Conservation of Energy for mechanics. 
U , which has the dimensions of energy, is called the Potential Energy. 
Equation (16) thus means: The sum of the kinetic and potential energies 
is constant for conservative forces. 

This energy principle represents a first integral of Newton’s Second 
Law, since in (16) only v (i.e. ds/dt, the first derivative) appears. But 
since the Second Law, being a vector equation, is equivalent to three 
scalar differential equations, we need two additional first order equa- 
tions in order to complete the first step in the integration of the 
equations of motion. 

With a given field of force it is, as a rule, not difficult to write down 
the potential at once. F may then be found by taking the gradient. 
Mathematically, this means the integration of the system 




dU 

dz’ 


to which one must have recourse in case of doubt. 


Ex. 35. A particle is bound to a fixed centre by an elastio force, i.e. one 
proportional to the distance from the centre. What is the potential correspond- 
ing to this force? 

5. Central Forces; the Law of Areas. 

A central force is one whose representative vector always points 
toward a fixed point 0. The magnitude of the force may be an arbi- 
trary point function. Naturally, we choose the point 0 as origin, so 
that F = f(x, y, z)r. 
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In the case of central forces, an important theorem may be derived 
imme diately. This law furnishes two additional first integrals of the 
fundamental equations. We form the vector product of r with each 
side of the equation representing Newton’s Second Law. Since F is 
in the direction of r , and since the vector product of two parallel 
vectors is zero, we have 


0 


= m |~ 


d 2 r 
dt 2 


or 


d*r 
dt 2 


]-* 


The integral may be given at once: 



(17) 


for differentiation gives 


\ d z *n+r 

L dt dtyi 



= 0 . 



\ [r dr] 


Fig. 4 


The vector product [rdr/dt] has a simple geometric significance. 
The magnitude of [r dr] is equal to twice the area swept out by the 
radius vector in the time dt (fig. 4). If we divide the area for dt by 
dt we obtain the area swept 
over by the radius vector in 
unit time — the so-called areal 
velocity c. Equation (17) 
thus states that the areal 
velocity is constant for motion 
under a central force. This 
is Kepler's Second Law of 
Planetary Motion. But equa- 
tion (17) says more. If the 
product [r(dr/dt)] is equal to 
a constant vector, the normal 
to the plane determined by r and dr has a fixed direction, i.e. the orbit 
is plane. For this reason equation (17) represents only two scalar 
equations which, however, together with the energy equation, are 
the complete first integral of the equations of motion. 

The vector m is usually called the moment of momentum or 

angular momentum P, about 0. 

Ex. 36. Verify the following generalization of the Law of Areas: if the force 
acting on a particle always passes through a certain fixed axis, the Law of Areas 
holds for the projection of the orbit on a plane perpendioular to the axis. 

Ex. 37. From the equation of the orbit r =* /(<p) and the areal velocity c de- 
duce the Law of Force. Use equation (9). Application of this line of thought to the 
empirically known Kepler ellipses led Newton to the discovery of the Law of 

Gravitation. 
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6. Gravitational Forces; Planetary Motion. 

Starting with Kepler’s Laws, at that time known as empirical 
facts, Newton derived his general law of attraction, specifying the 
forces which determine the motions of the heavenly bodies. The 
Newtonian Law of Universal Gravitation states that there is a force 
of attraction between any two masses and m 2 which is directly 
proportional to the product of the masses and inversely proportional 
to the square of their distance apart. That is 





Up to this point we have encountered the mass of a body only in 
connexion with inertia; here it appears in an entirely different form — 
as gravitational mass (weight). It does not follow that inertial and 
gravitational mass are identical; i.e. more accurately stated, that the 
ratio of inertial to gravitational mass is a constant (contained in y) 
for all bodies. If we assume the two kinds of mass to be the same * 
the constant y has the value 6*66 X 10" 8 C.G.S. units. 

If we neglect small perturbations, the gravitational field of the 
sun alone determines the orbits of bodies in the solar system. On 
account of its predominantly large mass M, we can consider the sun 
fixed at the origin of a co-ordinate system. Strictly speaking, we 
should investigate the motion about the common centre of gravity, 
which is at rest (cf. Chap. VI). The force acting upon the body of 
mass m is 



The corresponding potential is 

U = 


Mm 

-Y-, 


(19) 

( 20 ) 


as may be verified by forming the gradient. 

Since the force is central, the orbit is plane. We can describe it 
most simply in plane polar co-ordinates with the sun at the centre. 
The first integrals of the equation of motion, the Energy Principle 
and the Law of Areas, may be written down at once: 

i ~ y = £ wv 0 i 2 — y (Conservation of Energy), 

r r o 

= r 2 (f) = 2c (Law of Areas). 



* The equality of gravitational and inertial mass is a fundamental postulate of 
the Generalized Theory of Relativity. All experimental attempts to detect individual 
differences with various substanoes have been unsuccessful. 
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If we wish to describe the course of events in time, we must determine 
r and <f> as functions of t, by integrating again. Instead of doing this, 
however, we limit ourselves to the derivation of the equation of the 
orbit described by the body. For this purpose we eliminate the time 
from the two equations. Using 

v 2 = f 2 + r 2 <j> 2 (cf. p. 83), ^ = 4 ^ 4 = £ 

this is readily accomplished. From the energy equation, 


4c 2 
r * 


(n) 


+ 


4c 2 


2 y m VM 

— " U(\ • 


0 


The variables r and <j> are immediately separable and we have 


dcf> = 




V(* - ¥) 


2 yM _ 4c 2 

I . .9 


If we introduce a new variable u = 1 /r, we obtain 

2 cdu 

C = — 



Vq — -|- 2 yMu — 4 chP 

r o / 

This integral is of the familiar arc-cosine form 

dx 1 . / b — hx 


-/ 


V a -{- 2 bx 


If we introduce the abbreviation 


. = -4= cos -1 ( 

— hx 2 Vh \ 


V6 2 + oA, 




2yM y 2 M* 


+ 


4c 2 


and take the cosine of each member, we obtain 

4^ 

yM p 


r = 


l+^co^+G) 


1 + € COS (</> + cy 


(21) 


We may measure <f> from the maximum r, so that C — n. This gives 
the familiar polar equation of a conic with origin at one focus. This is 
Kepler’s First Law , which states that the orbits of bodies in the solar 
system are conics , with the sun occupying one focus. 

The orbit is an ellipse, parabola or hyperbola, according as the 
numerical value of the eccentricity e is less than, equal to, or 
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greater than unity. Putting in the values of e and a, we have an 
ellipse if 

2c l~ 2yM y *M 2 


yM 


4 


+ 


4c 2 


or 


v 


M 


< V 1 or i mv o 2 < 

z r n r 


< 1 

yMm,' 


a parabola if 
and a hyperbola if 


■, « y A/m 

i mv Q 2 = r - ; 

r o 

\rnv 2 


0 


That is, the orbit is an ellipse , parabola or hyperbola according as 
initially, the kinetic energy is less than , equal to, or greater than the poten- 
tial energy with sign changed. Thus, for an elliptic orbit, the total energy 
is negative. 

The following additional relations may be derived for the case of 
an ellipse. The parameter of the ellipse 




= semi-major axis 
semi-minor axis 


is equal to 4 c*/yM, by equation (21). Hence 

4c 2 


1- e 2 = 


yMa 


If we denote the period by T, the area of the ellipse is 

2na 2 c 


cT = nab = Tra 2 y/\ — e 2 — 


Thus 


T 2 = 


477- 2 a 3 

~yM' 


V yMa 


. ( 22 ) 


This is Kepler’s Third Law: The squares of the periods of the planets 
are proportional to the cubes of their orbital major axes. 

If instead of using the parameter p to compute 1 — e 2 we use the 
value of the eccentricity itself from (21), we have, on setting the two 

6QU<11 1 

ic * = 1 _ J*_ /„ 2 _ 2yd/ 

y 2 d / 2 \° ^ + Y<Y ) 

or 

w - nMn - e - ? Mm 


yMa 


r 

0 


2a * 


• • 


(23) 


means that the total energy in an elliptical orbit is a function only 
ot the size of the major axis, and all ellipses having the same major axis 
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(including the circle) have the same energy. This proposition simplifies 
many computations in the older atomic mechanics. The total energy 
has a negative sign because of the dominating magnitude of the poten- 
tial energy, which was so specified upon introduction of the potential 
that it vanishes at infinity but is negative at finite distances. 

7. Quasi-elastic Forces and Harmonic Vibrations. 

If a body is bound to a fixed point by means of a spring or a rubber 
band, any small displacement from the position of rest calls forth a 
restoring force proportional to the displacement and directed toward 
the position of rest, which we take as origin. That is, 

F = —hr. 

We shall call all central forces of this form quasi-elastic forces , even 
though, as may well happen, the force is not of an elastic nature (cf. 
Ex. 38, p. 94:)- The differential equation becomes 


m w +kr==0 


(24) 


We might now, as before, use the Principle of Energy and the Law of 
Areas to get first integrals, but direct integration, twice repeated, is 
just as simple here. If we try the solution 

r = ae^ 1 , r — A 2 ae AI , 


we obtain for A the “ characteristic equation 

mA 2 A: = 0, 




whose roots are 


= d k 

\ m 


and A 


. Ik 


The general solution, containing two constant vectors, is 

iV*. . , -#VE. 

r=ae ™ -j- be ™ . 

Using real functions (cf. p. 47), this is 


• • 


• ( 26 ) 


r= A cos 


• • • (25,) 


The meaning of the vectors A and B is readily ascertained. For 


l— 0 , 



94 


MECHANICS 


[Chap. 



The vector r is thus the resultant of two vectors which have the 
directions of the initial radius vector and initial velocity, respectively, 
and whose magnitudes are simple periodic functions of the time. To 
obtain the orbit, we introduce the unit vectors e T and e v . In this 
oblique co-ordinate system, the co-ordinates of P are 





Elimination of t gives 



This is the equation of an ellipse in oblique co-ordinates, the co-ordinate 
axes being in the directions of two conjugate diameters. The general 
orbit of a particle moving under the influence of a quasi-elastic force is 
the ellipse hiving the initial position vector and the initial velocity vector 
as conjugate radii , and having its centre at the centre of force. If 
v 0 has the direction of r 0 , the vibration is linear. According to equation 
(25), the frequency of a harmonic vibration is given by 1/277- times 
the square root of the quotient of the elastic constant by the mass: 

— s Vs < 26 > 


Ex. 38. A particle constrained to move on a straight line is subjected to an 
electrostatic force of attraction F l — — oc/a; a and a repulsive force F 2 = -f- 

P/a: 10 . Calculate the frequency of the vibration resulting from a small displace- 
ment from the position of equilibrium x 0 . 

8. Harmonic Vibrations with Frictional Resistance. 

In addition to the quasi-elastic force, let the particle be subjected 
to a frictional force, as is actually the case with vibratory motion in 
practice. Let us assume that this resisting force is proportional to 
the speed of the particle. A resistance of this kind exists when the 
particle moves at low speeds in the air or in any other fluid. Friction 
between solid bodies is, on the contrary, independent of the speed, 
within wide limits. \\ hatever the law of dependence of the resisting 
force on the velocity may be, this force is always opposite in direc- 
tion to the velocity of the particle. The total force acting onP is there- 
fore 
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We limit ourselves, in what follows, to rectilinear motion, and call 
the co-ordinate x. Then the equation of motion is 

mg+^J+fa=° (2V) 

To integrate, we again try 

x — a#*, 

which leads to the “ characteristic equation ” 

m A 2 -J- fiX -f k = 0. 

Its roots are 

A jgn „ nd a , _ _ / _ 

1 2m V 4m 2 m 2m \ 4m 2 m 


Two cases must be distinguished here: 

(a) fp > 4 bn. In this case only real quantities appear, and the 
general solution for ft 2 > 4 km is 

-A t ( VIli -VaTTAA 
x = e 2m \ae 4m * m 4- be 4m ' m ) 



In this aperiodic case, corresponding to large damping, the particle 
gradually returns to its equilibrium position, which it reaches at t = a> . 


Even the limiting case = 4Jcm results in an aperiodic solution. Superficially, 
it would seem that a double root of the characteristic equation would yield only one 
constant of integration. However, it is seen that here bte Xt is also a solution, so 
that the general solution is 


x 






(6) fP < 4 km. There are now imaginary exponents : 

-± t ( iVFZA t -,VA- A*. A 
x = e 2m \ae m ***' -\- be m 4m * /. 

Applying the Euler formula, and introducing new cor 
tegration, we obtain the real form 


x 


= e 2m (a cos 



4m 2 


t-\~ B sin 



4m 2 


t 



- £, 

x = Ce 2m cos 


(V 


• • • ♦ 


or 


• ( 30 ) 
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We again have a harmonic 
vibration whose amplitude, 
however, decreases exponen- 
tially (cf. fig. 5). This damped 
vibration has the following 
important properties: The 
frequency 

= 1 /FT 

2tt v m 4m 2 
is less than 

V -1 /* Fi * 5 
27r V m* 

that of the undamped vibration. The ratio of two successive maximum 
deflections in the same direction is 

pt 

p = e 2m (31) 

The natural logarithm of this ratio is called the logarithmic decrement 8. 
If the damping is small, we make no serious error by putting the period 
of the undamped motion in place of the period T. Thus is obtained the 
simple expression 

8 = 

Note: In mechanics, especially in treating the problem of the pendulum, the 
period is often taken to be half the quantity here designated by T. It is thus 
taken as the time between an extreme deflection on one side and the next one 
on the opposite side. Correspondingly, the logarithmic decrement is often referred 
to the half-period. 




9. Forced Vibrations; Resonance. 

In addition to the forces previously taken to operate during a 
harmonic vibration, let us assume that a periodic external force is 
present. This force may have the form 

F a = .F 0 coscut. 


With these three forces — quasi-elastic, frictional, and applied — the 
equation of motion becomes 


m 


d?x 

2? 


“I" ^ 5 ^ = ^ C03<Jji > 



Introducing the characteristic angular frequency of the free, undamped 
vibration, cu 0 2 = h/m, we have 


d 2 x 

dt* 


Bdx F 0 

+ - TT -f- OJrTX = — coscut. 

m di u m 
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To integrate this equation we use a theorem from the theory of linear 
differential equations, which states: If g(t) is a particular integral of 
this non-homogeneous equation, and if/(£, A, B) is the general integral 
of the corresponding homogeneous equation, 


dtx 

dP 


+ 1 d i+ w ° x=0 ’ 


then the sum g(t) -|- f(t, A, B) is the general integral of the non- 
homogeneous equation. On account of the linearity of the differential 
equation (32') — i.e. the absence of products of the dependent variable 
and its derivatives — the sum g(t) -j- f(t , A, B) satisfies the equation, 
as may be seen by substitution. Since this solution contains two con- 
stants of integration, it is also the general solution. The integral 
of the homogeneous equation is known from the preceding paragraph, 
and so we need only find a particular integral of the non-homogeneous 
equation. Since the force F a is a periodic function of angular fre- 
quency a), the trial of a periodic function of this kind is suggested. 
But since the first as well as the second derivative of x appears, it is 
evident that a sine or a cosine function alone cannot satisfy the equa- 
tion. We thus try 

x = p cosa )t -f- q smut = y ^ 2 + ? 2 cos(c ot — <j>) 

where the coefficients p and q are still undetermined. If we substitute, 
in equation (32'), the condition that the coefficients of cosa>2 and of 
sinctft must vanish leads to the following equations for p and q: 

pW — ^ 2 ) + ? — = 

m m 

K 2 -o- 2 )=0. 

From these we find 


P = 


m ^o ( w 0 2 — OJ 2 ) 


? = 


faF. 


m 2 (w 2 - w 2 ) 2 + /J 2 a> 2> 1 m 2 (a> 2 — <n 2 ) 2 + ^oj 2 ' 


V? 2 +? 2 = 


F, 


-1- P 


V w 2 K 2 — OJ 2 ) 2 4- /ft*, 2 ’ tan ^ p rn(w 0 2 — co 2 )' 


(33) 


The general solution for small damping [Case (6), p. 95] 


is 


6 


X=Ce 2 m 


■cos (JTTE t _\ 
\ ym 4 m 2 VJ 


+ 


(w* fa * cos 4)- ■ . . (34) 


(B 711) 
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The first part represents a damped vibration having the period of 
the characteristic free vibration of the damped system, i.e. it cor- 
responds to a transient phenomenon determined by the initial con- 
ditions, which determine C and y. This part dies out after some time. 
The second part represents a permanent vibration of the same period 
as the exciting force. This is the steady state. For stronger damping 
[Case (a)] the first part becomes aperiodic, while the form of the second 
part remains unchanged. 

We could have found the particular integral which describes the 
steady state much more simply by using the vector diagram (p. 52) 
which always leads to a result rapidly in problems involving the de- 
termination of the permanent modes, or stationary states, in vibration 
processes. If we write for the exciting force, F a = F 0 e l(t>t t and take 
the solution to be of the form 

x = *), 


then, by equation (32') (p. 96) we must have 


or 


a ( — co 2 -f- — ioj -f- = — 0 e iu,t , 

\ m ) m 

a ( oj 0 2 — co 2 -f- i e t4> ; 

\ m ) m 



the addition of the complex numbers, 
or vectors, on the left (fig. 6) gives a 
sum of modulus 


from the vector diagram (fig. 6). 
This has to be equal to F 0 /m, i.e. 

a = 



. . . . (33") 




The two complex members (vectors) of equation (33') must agree in 
direction as well as in magnitude, i.e. 



Thus the difference in phase between the exciting force and the forced 
vibration, as well as the amplitude of the latter, both depend upon 
a> 0 2 — co 2 . The phase difference approaches zero for the case of very 
slow vibrations, and increases to tt/2 for = cu 0 (resonance; see be- 
low), becoming tt for infinitely rapid oscillations. When co is given, the 
amplitude is greatest when oj 0 = co; i.e. when the frequency of the 
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Z-G)-Cl ) 0 

Fig. 7 


exciting force coincides with 
the natural frequency of the 
undamped system, not with 
the natural frequency of the 
actual system. In this case we 
speak of resonance * In the 
absence of all damping the 
amplitude becomes infinite at 
resonance. To show how the 
amplitude depends on the fre- 
quency of the exciting force, 
i.e. to draw the so-called 
resonance curve, it is con- 
venient to choose the square 
of the amplitude as ordinate. 
If we use the value for 


a 2 from equation (33"), it follows that 


a 2 = 


F 2 

f o 


fPa [ 


max 


m 2 (oj 0 2 — co 2 ) 2 + p 2 (o : 


m> K 2 —y) 2 + ^ 


CO 


Introducing the “ discord 99 a> — co 0 as independent variable x and 
putting, as an approximation, 

(o Q -j- co = 2a ), co 0 2 — co 2 = — 2coX 

in the neighbourhood of the resonance point, we obtain this simple 
equation for the resonance curve (fig. 7): 


a 2 jS 2 



° 2 max 

-f jS 2 * • • * 

For 

d 2 1 jS 2 

x becomes 

2 m 

a 2 ™, 2 + /S»> 

or 

e 

1 

o e 

11 

tOi._ 

SP 3 

II 

SI o* 

II 

(cf. equation (31)), 

or 

a> — a>o_ 

8 


27 r 




We therefore have the theorem: Half the interval between the two fre- 
quencies for which the square of the amplitude has half its maximum 
value, these frequencies being measured in terms of the free undamped 


* Strictly, this is the velocity resonance (of. Ex. 39). 
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frequency as unit, is approximately equal to the logarithmic decrement 
divided by 2 tt. That is, the weaker the damping of the system, the 
sharper the resonance. If we introduce the same simplification into 
(33'"), we see that the phase differences at the positions of half-value 
are 7r/4 and 3 tt/ 4 respectively. 

Ex. 39. If the equation of the resonance curve is calculated without resorting 
to the approximations made in the text, it is found that the maximum of the 
square of the amplitude does not occur at the natural frequency of the damped 
motion, nor at that of the undamped vibration. It is true, however, that the 
particle has its maximum energy (defined as the kinetic energy while passing 
through the zero-point) at co = co 0 . The maximum of the time average of the 
energy comes at yet another place. Compute the positions of the maxima of the 
specified quantities. 


10. Non-harmonic Vibrations; Sudden Changes of Amplitude. 

Rigorously quasi-elastic forces, which are exactly proportional to 
the amplitude, even when it is very large, do not occur in nature. 
The actual binding forces are to be represented by power series in 
powers of the displacement. In most cases, a single additional term, 
along with the first power, is sufficient. Now there are two essentially 
different kinds of possible laws of force. If the force is symmetric , i.e. of 
equal magnitude at corresponding points on both sides of the position of 
equilibrium, or rest-point, then only odd powers may occur in the law of 
force; otherwise we have an unsymmetric law of force, and hence an 
unsymmeta'c vibration. We restrict ourselves here to the more com- 
mon case of symmetric vibration, and add a term -f-cz 3 to the quasi- 
elastic force — lex. We further assume that e is positive, which means 
that the binding becomes looser with increasing amplitude. The 
equation of motion for free vibrations of this system may be integrated 
readily; however, the result is expressible only by means of elliptic 
functions. The important fact is that the natural frequency decreases 
with increasing amplitude. This becomes evident qualitatively if we 
consider the actual restoring force replaced by a mean quasi-elastic 
force. On account of the form of the law of force, the restoring force 
will certainly become smaller with increasing amplitude z, and so, 
by equation (26) (p. 94), the natural frequency will also become less. 
The behaviour of such a system under the influence of a periodic 
external force is very peculiar. Here, even with no damping, the ampli- 
tude cannot increase without limit, for if resonance exists for infinitely 
small amplitudes, an increase in amplitude must cause a departure 
from the natural frequency, so that resonance no longer exists for larger 
amplitudes. The amplitude in the stationary state may be calculated 
by the following approximate method. The differential equation is 
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m + lex — ex? = F 0 cos cat. . • • (36) 
dl~ 

The stationary solution is, at any rate, a periodic function of the time 
having a fundamental frequency o>/2 tt, and is thus representable by a 
Fourier series in multiples of o>. As a first approximation, we begin 
with the fundamental frequency and set 

Xj = a coso;£. 

The amplitude a is still to be determined. If we substitute this 
expression for x in equation (36), and make use of the trigonometrical 
formula 

cos 3 co£ = £ cos 3 cot f cose ot, 

we obtain the equation 

( — maw 2 ha — £ ea 3 ) cos wt — | a 3 cos Scot = F 0 cosa>e. 


If the fundamental vibration is to satisfy equation (36), we must have 

f-a 3 +(cu 2 -a, 0 2 )«+— =0. . . . (37) 

m m 


where we have put a> 0 for Vh/m, the natural frequency at infinites ima l 
amplitude. The determination of the roots of this equation will be 
discussed below. At this point we wish to show how the next approxi- 
mations are obtained. If equation (36) is solved for d^x/dt^y and if 
the first approximation is substituted in the right member, there 
results 


d?x 

d# 


— coscu£ cosa>£ + f — coscot -4- cos3arf. 

m m m 4 m 


By (37), this is equal to 


—co 2 a coscot + 


— cos 3 cot. 
4 m 


Integration gives 


x 2 = a cost ot — € 


36 rruo 2 


cos3arf 


eo?d) ^ 

a coscui - — -i cos Suit. 

oohw 2 


. . (38) 


lliis second approximation may then be substituted in equation (361 
In this manner, any number of terms of the Fourier series may be 
calculated. The convergence of this process requires a separate 
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investigation. It may be said here, however, that the series converges 
for e small. 

It remains to discuss the solution of the cubic equation for a. We 



obtain (fig. 8) a graphically in a y, a co-ordinate system as abscissa of 
the intersection of the cubical parabola 


with a straight line 



If we start with co large, the slope of the straight line is negative, and 
there is only one point of intersection P 0 . There is also but one inter- 
section for co = c o 0 . If, now, co 
decreases, the straight line assumes 
a direction such that there are 
three points of intersection P v 
P 2 , P 3 , corresponding to three 
possible amplitudes. The reso- 
nance curve has the distorted form 
shown in fig. 9. A more thorough 
analysis shows that if we approach 
from the low-frequency side, the 
amplitude corresponding to the lower branch is the stable one. As co 
increases we arrive at the limiting point G\ from this point onward, only 
the upper branch yields a real point of intersection in fig. 9. Thus, with 
continuous increase of co, the amplitude will jump suddenly from the 
lower branch to the upper, at or before G. Such discontinuities (jumps) 



Fig. Q 
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in amplitude, attributable to the non-harmonic nature of the restoring 
force, are frequently observed in vibration processes, both mechanical 

and electrical. 

Ex. 40. Treat in similar manner the unsymmetric non-harmonic vibration 
having F = — kx + ex 2 as the law of force, and determine the mean value of 

the displacement. 

11. Mechanics of a Constrained Particle. The Simple Pendulum. 

The motion of a particle is often limited by geometric conditions. 
If, for example, we consider a small mass hanging by a thread of length 
l , its motion is restrained to the extent that it cannot move beyond 
the surface of a sphere of radius l. The whole interior of the sphere, 
however, is accessible in this case. The geometric condition would be 
represented by the inequality | r | ^ l. But the commonest case is 
one in which the motion is confined to a surface or to a curve (i.e. the 
intersection of two surfaces). The condition is thus given by one or 
by two equations. In the last analysis, however, the physical attain- 
ment of these geometric conditions is realized by means of “ forces ”. 
In the given example, these forces are the molecular forces between 
the particles of the thread and those at the junction of thread and 
weight. However, we need not concern ourselves with the atomic 
interpretation of these constraining forces. It is sufficient to replace 
the geometric condition by an equivalent constraining force in order 
to reduce the mechanics of a constrained particle to the case of the 
free particle. 

An essential characteristic of all constraining forces is that they 
are perpendicular to the surface to which the motion is confined by 
these forces. If this were not the case, the constraints alone (i.e. the 
mere existence of a surface upon which the particle rests) could set it 
in motion without the action of externally impressed forces. This 
is contrary to everyday experience. For example, this would mean 
that a sphere resting on a smooth horizontal plane could set itself in 
motion. It is true that a tangential force may exist if the surface is 
rough, i.e. if there is friction between the particle and the surface; 
but this frictional force is inherently not a constraining force. If the 
magnitude of the constraining force is Z, and if n is the unit normal 
vector to the surface, the equations of motion become 


d 2 r 


. . . . (39) 


If the equation of the surface is G(x, y, z) = 0, then the gradient of 
the function G also has the direction of the normal. We may therefore 
write the constraining force as 


Z = A grad (7, 


(40) 
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where A is a scalar quantity, as yet undetermined, and depending 
upon the co-ordinates. Thus the equation of motion is 


m 


d 2 r 

It 2 


= F A grad G> 



or, in components, 

nix = F x + A my = F v -f A 


WZ2 = F* + A 


as 

dz' 


(41') 


The quantity A has entered as a new unknown. But since the equation 
of the surface G = 0 must also be satisfied, there are as many equations 
as there are unknowns. The same is true for the case of a particle con- 
strained to move on a curve which is the intersection of two surfaces 
G x = 0 and G 2 = 0. The system of equations then consists of the 
equation of motion 

mr — F + Aj gradGq -f* A 2 grad(r 2 , . . (41") 

containing the two unknown quantities A x and A 2 , along with the 
equations of the two surfaces. 

Since the motion is confined to the surface at all times, every 
element of path dr is perpendicular to the constraining force. For 
this reason no work is done by the constraints, and they do not con- 
tribute to the energy balance. In particular, if we are dealing with 
conservative forces, the equation of energy has the form derived for 
the case of the free particle: 

$mv 2 -j- U = C (42) 

If a given problem involves but two co-ordinates — the case of motion 
in a plane — which are further connected by the equation of a curve to 
which the motion of the particle is confined, then the energy principle 
itself is sufficient for the integration; for one co-ordinate may be 
expressed in terms of the other by using the equation of the curve and 
the resulting derivatives, giving a differential equation in a single 
variable. 

An instance of this kind is furnished by the simple, or plane mathe- 
matical pendulum. We understand this to mean a particle m fastened 
to one end of a weightless thread of length l. The thread restricts the 
motion of the particle to the inside of a sphere. We confine the dis- 
cussion to plane motion, which may be realized, for example, by 
drawing the pendulum aside a given distance and then releasing it 
from rest. The plane of the motion is then determined by the initial 
position and equilibrium position (vertical) of the thread. The path 
still remains plane if the particle is given an initial velocity along the 
tangent to the circle cut from the sphere by the above plane. 

On the other hand, the orbit is a space curve if the particle receives 
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an impulse the line of which does not lie in the plane. The conser- 
vation of energy alone is not sufficient for the solution of the motion 
of a space pendulum; an additional first integral of the equations of 
motion is needed. In this instance, the generalized Law of Areas 
(cf. Ex. 36, p. 89) will serve. This case will not be treated further here. 

Returning to the discussion of the plane pendulum, the conser- 
vation of energy may be stated— using polar co-ordinates, in which 
the constraint is expressed simply by r — l, 

^Pfi+mgHl-cosft^C, .... (43) 

z 


the potential of gravity being given by mgz. As long as we rule out the 
case where the pendulum turns completely over, there are always two 

points at which the motion reverses, and where <j> becomes zero. If 
we call the ma ximum amplitude, corresponding to such a reversal 

point, the angle a, then, since <j> = 0, (43) gives for this point 

mgl(l — cos a) = G , 


and the energy equation becomes 

<j> 2 + (cos a — cos <f>) = 0 (44) 

v 


The integration is not difficult for small amplitudes. If we develop 
the cosines in series, and drop all terms beyond the squares, we obtain 



S-V? 1 ”’- 

-n 



from which 

d<i> 



. (46) 


< 

P ,o 

1 

II 

• • • • 

Integration gives 





sin- 1 (*) = ^ 

Ijt+k or <j> = 

a sin ^ 

/? 1 + *)■ 

• (46) 


The constant of integration k may be made zero by specifying that 
t = 0 when </> = 0, i.e. when the particle passes through the position 
of rest. We thus have a simple harmonic vibration of frequency 

V= l^ll («) 


T=2tt 




The period is thus 
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The actual equation of motion may be obtained by differentiating 
equation (43) with respect to t\ this equation might also have been 
derived directly. It is 

^ + |sin^ = 0 (48) 


The first approximation putting sin </> = </> gives the differential equa- 
tion of simple harmonic motion. If the sine is developed to one addi- 
tional term, we obtain an equation exactly like that of non-harmonio 
vibration, which was treated at length in § 10 (p. 10U),* 


43 


7 , 9 , 9 9' _ n 

+ 6 ”°* 



The pendulum, then, represents, for moderate amplitudes, a simple case 
of non-harmonic vibration. The rigorous integration for arbitrarily 
large amplitudes leads to elliptic functions. Thus, if we start with 
the energy equation (44) (p. 105) and replace cos<£ by 1—2 sin 2 (<£/2) 
and cos a by 1 — 2 sin 2 (a/2), we obtain, after separation of the 
variables. 



This is the exact equation corresponding to (45) (p. 105). 


The integral is elliptic, and cannot be expressed in terms of elementary func- 
tions; it is, however, readily brought into the normal form of an elliptic integral 
of the first kind. If we put 


we obtain 


. © .a 
bid T = sm - emu, 




Integrating between t = 0 and t = t, with the initial condition that for t = 0, 
9=0 and therefore u = 0, 



This integral is evaluated in many tables, where it is given in terms of u and 
a/2. See, for example, E. Jahnke and F. Emde, Tables of Functions (New York, 
Dover Publications, 1943). The graphs on p. 61 of this book show vividly the 
departures from the limiting case of a = 0. 


* There we also took into account the presence of a periodic external driving 
force. 
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The inverse of this integral is the Jacobian Function 




The function sn (sine amplitude) is given in tables, with the help of which the 
course of the solution of (53) may be traced. 

We inquire, finally, the magnitude of the restraining force which holds the 
particle in its circular path. According to Newton’s Third Law concerning the 
equality of action and reaction, this force is equal to that with which the particle 
pulls on the cord. Since it is normal to the given surface, it must be radial. In 
the neighbourhood of the equilibrium point, the constraining force is certainly 
directed towards the centre. 

The magnitude is readily found by writing the equation of motion obtained 
by resolving along the normal. Taking, as usual, the direction toward the centre 
of ourvature as the positive direction of normal acceleration, we have 




—mg cos 9 -f- 


Putting for 9* its value from the energy equation (44) (p. 105), we obtain 


Z = mg (3 CO89 — 2 cosa) 


154) 


The constraining force Z changes its sign when 

3 cos 9 m = 2 cos a. 

This equation has a root which is compatible with the condition that 9 < a 
only when a > n/2. This means that as long as a does not exceed tc/ 2, the thread 
remains taut. Arbitrarily large amplitudes are, however, not possible with a 
pendulum consisting of a particle suspended by a thread, for there is a point 
(viz. where 9 = 9^) where the pendulum collapses. If we take, for example, 
an amplitude a = k (this means that the particle just reaches the top of its 
circle) we have 

00s 9 m =-f, 9m = 131° 49'. 

The reason for this peculiarity is that a thread does not secure the geometrio 
condition r = l completely; it excludes points for which r > l, but not points 
for which r <1. A satisfactory mechanism is provided by replacing the thread 
by a rigid, weightless rod. 

Ex. 41. For what curve is the period of plane pendulum motion independent 
of the amplitude ? [Introduce the length of arc 8 as a funotion of 2 = l (1 — cos 9)]. 

Ex. 42. A motor-cyclist is able to take a ourve of 10 m. radius on a flat, 
unbanked road at 20 km. /hr. without skidding. With what speed must he ride 
on the vertical inside surface of a oiroular oylinder of 5 m. radius, made of the 
same material, in order that he be able to ride in a horizontal path without 
slipping down? Recall that the traotion force due to friotion is proportional to 
the normal pressure and independent of the speed. 



CHAPTER VI 


General Theorems on the Mechanics of Systems 

of Particles 


We represent a physical system schematically by means of a num- 
ber of particles and apply the results of Chapter V to the individual 
particles. From these results we can deduce laws referring to the 
behaviour of the system as a whole. The range of validity of these 
laws is almost unlimited, since we can, in the last analysis, consider 
any body to be composed of a system of enormously many particles — 
electrons and protons. The forces acting upon a single particle are of 
two kinds: either they originate in the other particles of the system — 
internal forces — or they have their origin outside the system — external 
forces. Newton's Third Law of the equality of action and reaction is of 
especial importance here. We understand this law to have the follow- 
ing meaning: If a particle P 1 exerts on P 2 an attractive force directed 
toward P v then it is at once evident that P 2 attracts P x toward P 2 
with the same force, i.e. the force vectors of the two internal forces 
resulting from the mutual attraction are equal and opposite. 


1. Theorem concerning the Motion of the Centre of Mass. 

We start with a system of N particles. The ikth particle is subject 
to the following forces: A number of external forces which we replace 
by their resultant F k , further, the force F lk due to the presence of the 
first particle, F 2k from the second, &c. — in general, F ik from the tth 
particle. The equation of motion for the &th particle is thus 

~diF ~ "I - F <)fc . (1) 


There are N such equations, one for each particle 
written down and added together: 

d 2 r k 


Imagine them all 


Z k m k = S*F* + Z*S<F,* (2) 


Since internal forces having both subscripts alike do not exist, accord- 
ing to our notation, the combinations k = i are to be excluded from 
the double sum. Now for every force F, k — the force exerted by the 
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9 th particle on the ^th— there corresponds a force F fc , — that exerted 
by the fcth particle on the jth — and these two forces are equal and 
opposite. Hence the double sum vanishes, and the internal forces of 
the system cancel out in the summation. There remains in the right 
member only the vector sum of the external forces acting on the indi- 
vidual particles. We now define the centre of mass of a system to be a 
point whose radius vector r (referred to an arbitrary centre) multiplied 
by the total mass of the system is equal to the vector sum of the pro- 
ducts of individual radius vectors of the separate particles with the 
corresponding masses: 

Mr='L k m k r k (3) 

If we substitute this expression in equation (2), we have the theorem 

F* (4) 


The centre of mass of a system moves as if the entire mass of the system 
were concentrated there , with the resultant of the externally applied forces 
acting at that point. In particular, if there are no external forces, the 
centre of mass remains at rest or in a state of uniform rectilinear 
motion. As is well known, this theorem is the basis of the explanation 
of recoil phenomena. For example, if a shot is fired from a camion 
standing upon a smooth horizontal plane, then the gun must spring 
back with a velocity such that the co mm on centre of mass of cannon 
and projectile remains in the vertical line through the point of firing; 
for, neglecting friction of the gun with the ground, the only external 
force is gravity, which has no horizontal component. 

Since the most universal external force is that of gravity, the centre of mass 
is commonly referred to as the centre of gravity. Another name for it in equally 
general use is the centre of inertia. The following elementary considerations are 
useful in determining this point: If r is the radius vector of the centre of gravity 
of two particles TOj and m 2 , then 

( w i + m 2 )r = wijr, - (- m 2 r v 
or m x {r- r x ) = m 2 {r 2 - ?)• 

this means that the vectors r — r x and r 2 -r are parallel. But since they have 
the terminus of r in common, the three points m lf m 2 and the centre of gravity 
are collinear. The position of the centre of gravity is determined by 

m 2 

"h 

We thus have the rule: The centre of gravity of two particles m 4 and m 2 divides the 
distance between the particles in the ratio of the two masses , the centre of gravity 
being nearer the larger mass. If, now, a third particle be added to the system, 
the centre of gravity of the set will be the centre of gravity of m 3 and the original 
centre of gravity, where both m x and may be considered concentrated. It is 
readily seen that the centre of gravity, found in this way, is independent of 
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the order in which the particles are taken. The procedure is similar for additional 
particles. 


Ex. 43. Two particles and m 2 are in equilibrium when at a distance a 
apart. If the distance be altered a small amount x, there arise restoring forces 
proportional to x. Calculate the characteristic frequencies of the system. 


2. Angular Momentum of a System of Particles. 

According to § 5, p. 88, the angular momentum (or moment of 
momentum) f> = m\rr] of a particle is equal to 2m times the “areal” 
velocity of the radius vector. The vector sum called the 

total angular momentum (or moment of momentum) of a system of par- 
ticles, is one of the most important quantities in mechanics. We shall 
now investigate the properties of this quantity. For this purpose we 
multiply the equations of motion for a particle vectorially by r k and 
sum over all the particles. The result is 

Wk 

As shown at p. 89, the left member represents the time derivative of 
the quantity hm k [r k rk\, that is, of the total angular momentum. 
Further, the vector product of the radius vector to the point of appli- 
cation of a force by the force vector is called the moment of the force 
F fc . We denote it by M fc . The magnitude of M* corresponds to the 
product of force by lever arm, in relation to turning about 0. The 
total moment of the external forces is given by The second 

term in the right member of (5), which represents the resultant of the 
moments of the internal forces, vanishes if the internal forces between 
two particles have the direction of the line jo inin g the particles, i.e. if 
the forces are central. Thus, since F jk = — F fc „ we have for any pair 
of particles 

[nPiJ+I^Fj^^-^FJ. ... (6) 

But the vector product on the right vanishes, since we are assuming 
that F )k is in the direction of r k — r t . There remains, therefore, 

|ji)« i [y l r i ] = ^=S l [nPJ=lI. . . (7) 


dV; 

It* 


! 


FjJ + 'L k 'Z i [r k F j f\. . . (5) 


For a system of particles in which the forces between any two particles 
are in the direction of the line joining these particles , the rate of change 
of the total angular momentum is equal to the sum of the moments of the 
applied forces. 

The limitation made above is actually of little importance. From considera- 
tions of symmetry, it is difficult to imagine a force acting between two points 
which does not coincide in direction with the line joining them, for there is no 
other pre-eminent direction. If the Law of Biot-Saxxirt (p. 303) seems an excep- 
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tion, it mast be remembered that this law deals with the force between a magnet- 
pole and an elementary segment (i.e. not a point or particle) of an electrical con- 
ductor. 

In the particular case in which there are no external forces acting, 
or if the total moment of the forces vanishes, then, according to equation 
(7), the total angular momentum of the system remains constant. In 
this form the theorem explains a large variety of phenomena of every- 
day life, e.g. the method by which a child sets a swing in motion, the 
ability of a falling cat to right itself before landing, the familiar turn- 
table experiments, &c. This law finds one of its most beautiful appli- 
cations in explaining the Einstein-de Haas Effect in Magnetism (cf. 
p. 464). 

In general, the value of the total angular momentum depends upon the choice 
of the reference point 0. If, however, the centre of gravity of the system is at 
rest, this quantity becomes independent of the choice of 0. If we denote the 
radius vector to a new centre O' by r Q and a radius vector emanating from O' 
by r k ', then 

r k =r o + r k \ 

The angular momentum, referred to O, is 

* = 

and referred to O', 

P'= Z k m k [r k 'r k ^]. 

If we put r 0 + r k for r k in P, we have, on account of f 0 = 0, 

P = Sjfcmjfeffro + rtfrjf] = [r 0 .X k m k r k '] + P\ 

But the first term vanishes if 

S m k d ^JL = 0, 

* dt 

i.e. if the centre of gravity is at rest. 


3. Total Energy of a System of Particles. 

Let us multiply the equation of motion of the Jcbh particle scalarly 
with dr/cU, and sum over all the particles. Then 

&r k dr k 
dt* dt 

Integrating between the times t 0 and t: 

dr k 




dt dt) ~ s * p * + 2* f' p '* IT 


( 8 ) 




dt 



•nXt) 


r. 




The left member represents the total change in kinetio energy of the 
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system, the right member gives the work done by the internal and 
external forces. But it is by no means the case that the work done 
by the internal forces cancels out in calculating the energy, as one 
might expect it to do in view of the results of the two preceding 
sections. 

The kinetic energy may be divided into two parts, each of which 
has a physical meaning. If we introduce a second co-ordinate system, 
whose origin O' is at the centre of gravity of the system, and if we 
denote all radius vectors referred to this system by primes, we have 

r k = r + r k '. 

Then, identically, 

- * (S 

The second summand on the right vanishes, however, since T,m k r k /M 
is, by equation (3) (p. 109), the radius vector of the centre of gravity — 
and this, by hypothesis, is zero in the primed co-ordinates. The first 
term on the right represents the kinetic energy of the system, con- 
sidering the entire mass to be concentrated at the centre of gravity. 
The last term gives the kinetic energy of motion of the system referred 
to the centre of gravity, when considered as at rest. Thus, we may say: 
The total kinetic energy is equal to the translational kinetic energy of the 
entire mass , considered concentrated at the centre of gravity , plus the 
energy of motion of the parts of the system relative to the centre of gravity. 

We further assume that the internal forces are such that they are 
derivable from a potential. The potential of the force operating 
between the points j and A; is a function of the distance between the 
two points, and therefore of their co-ordinates: 


v dr ^ drf lv / dr k '\ 2 

St + ^ kmk \w ) ■ 


drf 

dt 


( 10 ) 


U ik = U ik {r ik ) = U fk (V ( x i — x*) 2 + (y, — y k ) 2 + ( z , — z*) 2 ). (11) 


The force acting on k is obtained by taking j to be fixed, and con- 
sidering k to move in the potential field given by the point function 
U ik \ i.e. we consider the co-ordinates of j to be fixed, those of A; to be 
variable. Then, 

P i ^ U jk jdUjk i^dU jk j tj /-.ox 

in like manner, 

P £^U fk -dU jk i^dUj k _ ^ tj y ( 13 ) 

3 dy, grad, t/ rt - F«. (Id) 
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The work done in causing small displacements of j and k is 


F fk dr k + F 


kj dr j — ^ 


dU *dx k + dU,k ' • dU ~' ‘ dU 


dX; 


dyk 


dy* + dz k + dX, 


dXj 


+ d -£<idy l + d ^dzA = 




dz f 


= ~dU, k 


(14) 


The negative of the sum of F jk dr k and F kj drj is therefore obtained by 
forming the total differential of U jk , defined as a function of the six 
co-ordinates of the two points, in (11). If, then, we wish to introduce 
the internal potential into the right member of equation (9) (p. Ill ), 
we must write 

= — \'2i k Yi J dU /*, {k =\= j). • . (15) 


It is readily seen that the factor 1/2 enters: If we start with point 
1, and calculate the mutual energy U jk between this and all the other 
points, k runs from 2 to IV; but when we take point 2, we must start 
counting with 3, since the mutual effect of points 1 and 2 was already 
taken into account in dealing with point 1, and so on. Thus, in ex- 
tending the s umm ation over all combinations j and k, we must divide 
by two. 

If the external forces have also a potential, the energy equation (9) 
becomes 


T -|- 2*17* -{- ^-2*2^17 jjg 

= + 2* {/*<«> + U lk <* = const, (j * Jc), (16) 

where T denotes the kinetic energy. The sum of the kinetic energy 
and of the external and internal 'potential energy of a system is constant , 
if the external as well as the internal forces are conservative . 

Ex. 44. A solid sphere of radius r and density p rolls without slipping down 
a plane inclined at an angle a to the horizontal. What is the velocity of its centre 
after rolling a distance a , and how does this velocity compare with that which 
would be attained by the sphere in sliding without friction the same distance? 

Ex. 45. Give the theory of the central impact of two spheres, masses m x and 
»» » whose velocities are along the line joining their centres: (<z) for the case of 
perfectly elastic spheres, where no mechanical energy is lost in the impact (i.e. 
converted into heat); (6) for a case of inelastic impact in which m 2 acquires the 
internal energy e. 


4. The Principle of Virtual Dioplacements, d’Alembert’s Principle and 
the Lagrangian Equations of the First Kind. 

Let any number of forces be applied to the fcth particle of a system. 
Assume that the individual particles are completely free, i.e. there are 
no constraints imposed by geometric conditions. Then the condition 
that the system is not to move under the influence of the forces is 
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evidently the condition for equilibrium , viz. that the resultant 
to vanish: 




is 


If we imagine each particle displaced from its rest position by a 
vector hr k , then £F fc Sr fc represents the total work done in such 
a virtual displacement. Since each F k vanishes, this sum is evidently 
zero. If a system of free particles is in equilibrium , the total work done 
by all forces, internal and external, in a small virtual displacement is zero. 
If the forces have a potential U = ££/*, then 


2F l fir k =-'L8U k =-SU=0. . . . (17) 

The vanishing of the variation 8 U means, however, that the function 
has a stationary value. In a state of equilibrium the potential energy of 
a system of free particles has a stationary value. 

At first sight these considerations seem trivial; their utility first 
appears when limitations are imposed in the form of geometric con- 
ditions; e.g. the distance between two particles is to remain constant, 
or certain particles are to be confined to given surfaces, &c. The con- 
straining forces, as was shown in Chap. V (p.104), can do no work. 
Consequently the principle of virtual displacements holds also for the 
equilibrium of a system with constraints, provided we understand that 
the F fc are the actual given forces. On the other hand, it is no longer 
true that F fc must be zero; on the contrary, an applied force may be 
balanced by a constraining force. For example, the force of gravity 
acting on a mass which rests upon a horizontal supporting surface is 
balanced by the reaction of the surface. This difference between a 
free and a constrained system is expressed mathematically by the 
fact that the displacements &r k are no longer independent, but are 
such that certain auxiliary conditions are satisfied. If we write equation 
(17) in terms of the co-ordinates, which we designate by consecutive 
numbers instead of by x, y, z, then the sum 

X i F i Bx i = 0 (17') 

has 3 N terms, for a system of N particles. Each equation of condition 
reduces the number of 3 N arbitrary variables by one. For example, 
if one equation of condition exists, we can express the last displace- 
ment in terms of the others, and substitute in equation (17), which 
will then contain only independent displacements. However, the 
following procedure, devised by Lagrange, affords a much better 
general view of the process: let the conditions be given in the form 
of finite equations connecting the co-ordinates x { ( holonomous systems), 
which are independent of the time ( scleronomous systems). 

Let there be l equations of condition. The &th of these, 


fk (ah, x 


2 , 


x 3N ) — 0 , 


• • • 
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gives upon differentiating, 





If we multiply these equations by certain provisionally undetermined 
multipliers A*, and add them to equation (17'), we have 


2 . (f t + 




In this sum of 32V terms we can select the multipliers in such a way that 
the last l terms vanish. The remaining (32V — l) terms then contain 
only (32V — l) displacements, i.e. as many as there are independent 
quantities Sx* present. However, since these are independent of 
each other, their coefficients must vanish separately, in order that 
the sum be zero. For we may select the purely arbitrary displace- 
ments in such a way that, for example, afl the except 8x A are 
zero, in which case the only rema inin g member of the sum is 



But since, by hypothesis, Sx„ is to be different from zero, the 
expression in brackets must vanish. We thus obtain the system 
of equations 








Despite the fact that these equations are all of the same form, they 
have different origins. I of these equations hold because we purposely 
determine the l quantities A* in such way as to make them true; the 
remaining (3 N - l) equations are true, however, because the deter- 
mination of the A* in this way makes the (3 N - l) displacements 
independent of one another. Besides these 3 N equations, we have 
also the l equations of condition, so that the number of equations 
is equal to (32V + l), which is the same as the number of unknowns, 
viz. the 3iv co-ordinates x, of the particles, and the l multipliers A„ 
Un the other hand, if we take the constraining forces Z* which em- 
body the geometric conditions, instead of the conditions themselves 
we then have a free system for which, from above, 

Z t = 0. 


( 21 ) 



MECHANICS 


1 1 G 


[Chap. 


Comparison with equation (20) shows that the constraining forces are 
represented by 



Thus far we have been occupied only with statics, or equilibrium, 
but dynamics can be treated in a quite similar way. Denoting the 
resultant of the forces applied to the &th particle by F*, that of the 
constraint forces by Z k , we have 


^ + Zj. — m k 


d 2 r k 

dt 1 



If a virtual displacement is again performed, then the constraints by 
themselves can do no work, and we must have 2* Z k 8r k = 0. We 
thus obtain the relation known as d’Alembert's Principle: 

z*(f 


k — m k 


d 2 r, 
dt 2 


■ ^ S r t = 0. 


• • (23) 


Since the equations of condition limit the freedom of the 8r k , we 
cannot immediately conclude from (23) that the individual terms of 
the sum vanish. We must, in fact, proceed exactly as in statics. Again 
we label the co-ordinates consecutively, remembering that each mass 
occurs three times, i.e. = m 3n _ 1 = m 3n _ 2 ' The relation (23) 
then becomes 

2(F* — m i x i )hx i = 0 (24) 


Let the l equations of condition again be finite equations. Differentia- 
tion gives 



We again multiply each of these equations by a multiplier A fc (for 
the present undetermined), and add them to equation (24), whence 
we have 



F < - m ‘*< + A > l f i + H 





Exactly as in the statical case, we can dispose of the l quantities X k 
so that the first l parentheses vanish. The (3 N — l) remaining paren- 
theses contain as many displacements as there are independent vari- 
ables present. Since we may choose these 8x t at will, the remaining 
parentheses must also vanish separately. We then obtain a 
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system of equations known as Lagrange’s Equations of the First Kind : 


m l x 1 = Fi + 

} • • S' 






/ 



tn i x l =F i +\^+ K dx , 


a/, , *V. + . 


dx { 




m,„x,„=F,*+K^r+ K dx 


^ ■ * 3 A + ...A, 


3 /, 


3 AT 


3 a: 


3 AT > 


(26) 


In this case also we easily see that the terms added to F { in the right 
member are the components of the constraining forces, which must be 
introduced as the forces corresponding to the geometric conditions, 
if we wish to treat the particles as free. The number of equations is 
the same as the number of unknowns. The Lagrangian equations 
contain, besides the 3 N co-ordinates of the particles, the l multipliers; 
on the other hand, we have, in addition to the 3 N Lagrange equations, 
the l equations of condition 

F • * * ^3^) “ 


If the integration of the entire system of equations (26) (including 
the l equations of condition) can be performed, we obtain the com- 
ponents of the constraining forces, since the X k are also known. These 
forces of constraint are equal and opposite to the pressures or tensions 
exerted by the particles on the guiding mechanism which secures the 
geometric conditions. Such forces were studied for a single particle in 
the case of the pendulum. A pertinent example, where external forces 
are absent, is that of a particle whirled about at the end of a string 
tied to a fixed point in a horizontal plane, the force of gravity being 
balanced by the reaction of the plane. The centripetal force exerted by 
the string is the equilibrant of the centrifugal force of rotation of the 
particle. 

Another way of looking at the question, convenient in many com- 
putations, is the following: The fundamental equation 



may be considered to express the equilibrium between the applied 
force F and the inertia force —mffir/dt 2 ). In this instance, however, 
Newton’s second law must be formulated to read: 

During the accelerated motion , a force —m(d^r/dt 2 ) acts on the mass m 
and there is equilibrium between applied and inertial force. This view 
formally refers dynamics back to statics, and the considerations of 
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statics may be applied to dynamics, so that we arrive again at (23). 
The difference in the two approaches may be illustrated by the ex- 
ample of a spring-type weighing machine, standing on which is a 
person who produces accelerated motion by flexing his knees. With- 
out the introduction of inertial forces we have 


m 


d 2 z 
dt 2 


= —mg -f F , 


where F is the restoring force in the spring. The other view says that 
the force F' exerted by the spring must equal the resultant of the applied 
forces. The latter are the gravitational force — mg and the inertial 
force — m(d 2 z/dt 2 ), so that 



d 2 z 

dt 2 ' 


According to the third law, F' must equal — F, so that both approaches 
lead to the same result. While the second formulation often yields the 
formal result more quickly, it must be remembered that the first one 
usually takes into account the physical situation more satisfactorily. 


5. The Lagrangian Equations of the Second Kind for Arbitrary Co- 
ordinates (Generalized Co-ordinates). 

The Lagrangian equations of the First Kind refer to rectangular 
co-ordinates. However, it is often more useful to specify the positions 
of the particles by means of other parameters. This almost invariably 
permits us to regard the equations of condition as equivalent to hold- 
ing certain co-ordinates constant, and these co-ordinates then drop 
out of the conditional relationships. Although the introduction of con- 
ditions is also possible with any system of co-ordinates, we consider 
here the normal case where, in place of 3 N rectangular co-ordinates 
and l equations of condition, we have / = 3 N — l independent para- 
meters q v q 2 , . . . q f . For example, let all points of the system be 
required to remain on the surface of a sphere. In this case the correct 
co-ordinates are spherical co-ordinates, r, <£, 0, and the variable quan- 
tities are only the angles «/> and 9. It remains to express the equations 
of motion in these arbitrary co-ordinates. For this purpose we 
imagine the rectangular co-ordinates to be given as functions of the 
independent q k (called the generalized co-ordinates) 

X{ = X{(q \ , q 2 i • • • J/)* • • • • • (27) 

Further, we seek to replace the x { , 8x { and x { appearing in the d’Alem- 
bert equation 

2, (F t - ro,®,) 8x i =0 f (24) 
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by the q k and their time derivatives. Now 


and similarly 




(28) 


(28') 


The expressions x^dx^dq^ which result from substitution of (28) in 
(24), may be transformed as follows: it is verified at once that 

. • (29) 


dx { __ d / . 3xA _ J /dxA 
dq k dt\ t dq k ) * dt\dq k ) 


.. dx { 


Now we can consider the x < to be functions of the 2 f variables q k 
and q k * 

Equation (28') then gives 

dx ( dXj 

dqk 

Then the first term in the right member of equation (29) becomes 


(30) 


d / . 3xA _ d / . 9&A __ d /, 
dt\ X, dqJ~ dt\ X, dq k )- dtV 



(31) 


The second term may also be transformed in a way which leads to 
easily interpreted expressions. We have 


d /3 xA _ d 2 Xj . , 
dt \dq k ) ~ 3 q x dq k qi 


d 2 x ( . d 2 x { . 

?2 “1“ • • • o _ ^ _ ?/ 


from which 



x iy 


(32) 


(33) 


Then, according to equations (28) and (29), 

m ( x i 8x { ='Z k (Jj i ^jl m, x? — A £ m, xfj 8 q k . (33') 

* The introduction of the q k as new variables, in addition to the q k , is apt to be 
misunderstood, but consideration of the following example of a similar kmU may 
help to remove the difficulty. The only independent variable is the time, upon which 
the zt and se< depend, through the medium of the q k . However, if a funotion / (y, y") 
is to be differentiated with respect to the independent variable x, where y' stands for 
dyjdx, we use the formula 


V\ 


dx nv,yi a y ix dtf da 
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We further assume that the acting forces have potentials U { , which 
we imagine to be given as functions of the q k . Then 



If we now substitute expressions (33') and (34) in the statement of 
d’Alembert’s principle, there results 




i Mi *i 2 + 




But S ,x,- 2 represents the total kinetic energy T of the system. Then, 
since the bracketed expressions mu>t vanish individually, on account 
of the independence of the displacements 8q k , and since V depends 
only upon the position co-ordinates q k and not upon q k , we obtain 
the following Lagrangian Equations of the Second Kind for conser- 
vative forces: 


d d(T- U) _ d(T- U) = 
dt dq k dq k 



The difference between the kinetic and potential energies, which is a 
function of the time (through the position co-ordinates and their time 
derivatives), is called the Lagrangian Function L of the system: 

L=T— U (37) 

Using this notation, equation (36) may be written 


d dL_ dL _ 

dt dq k dq k ~ 


• • • • • 



If, in addition to the conservative forces F { , non-conservative forces 
F / are also acting, then the work done by the latter in a virtual dis- 
p'acement is given by inserting the values of from equation (28) 
(p. 119): 



The coetlicients of the 8q k are called the components of the forces 
referred to the generalized co-ordinates q k , and are denoted by F' Qk . 
With the admission of non-conservative forces, the Lagrangian equa- 
tions of the second kind thus become 


d d_L_ d_L_ 
dtdq„ dq k 


t • • # t 


(40) 
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It is to be remembered that we have treated only the most common case, 
i.e. the holonomous-scleronomous system defined on p. 114. For a discussion of 
the cases in which the equations expressing the x i as functions of the q k 

(а) contain the time itself explicitly (holonomous-rheonomous system), and 

(б) are given in the form of non-integrable differential equations 8z ( = 

-f. a t (i) Sq 2 -j- . . . a/‘> 8q f (non-holonomous system), consult the more com- 
plete textbooks on mechanics. 

Ex. 46. Set up the equations of motion, in spherical co-ordinates, of a particle 
on the surface of the earth. 


6. Generalized Momentum Co-ordinates. Hamilton’s Equations. 

In the case of holonomous-scleronomous systems, the x i are given 
as functions of the q k by finite equations which do not contain the 
time itself, and the kinetic energy T is a homogeneous quadratic 
function of the time derivatives of the q k : 




,3 Xi . 

+ ?2 + • • • 



dq 2 lz ’ " dq 

= i(Puii 2 + + . . . + +...)• 



Now in rectangular co-ordinates, each component of the momentum 
is given by 

Vxi = mpi = 



In analogy with this, we call the following quantities the generalized 
momentum components: 

p* = - I ff- = +•■■ p >*f’ • («) 

although these quantities may or may not have the dimensions of 
momentum, according to the meaning of the generalized co-ordinates. 
Since the instantaneous state of a system is uniquely characterized 
by giving the position and velocity (momentum) of every point, we 
speak of momentum co-ordinates instead of momentum components, 
as the q k and the q k have already been given equal rank in the La- 
grangian equations of the second kind. 

According to equation (43), the momentum co-ordinates are linear 
functions of the generalized velocity components q k . Since the de- 
terminant of the system of equations does not vanish (as a more 
thorough analysis shows) the system may be solved for the q ky so that, 
reciprocally, the velocity components q k are also linear functions of 
the generalized momentum co-ordinates p k . If we express the equa- 
tions of motion in terms of the generalized momenta, we obtain a 
new form, due to Hamilton. This form furnishes the best starting-point 
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(44) 


for many investigations in higher mechanics, especially in celestial 
mechanics, and so has come to be called the canonical form of the 
equations of motion. Introduction of the momentum co-ordinates 
(43) into the Lagrangian equation (36) (p. 120) gives at once for con- 
servative forces, since V is independent of the q k , 

dpk = dL{q k , q k ) 

& 

This equation is still unsatisfactory in that the Lagrangian function 
in the right member contains the q k instead of the p k . We could try 
to express the q k in terms of p k by means of the transformation (43), 
but it is simpler to introduce a new function H of q k and p k defined by 

H{p k , q k ) = Z k p k q k — L(q k , q k ) (45) 

The total differential of this function is 


dH = 


v /dH dH , \ 

- 2 * (if, ir - + 37, '"•) 


= - £ * fr M* — ,ll h- 

oq k oq k 


(46) 


Since the potential energy depends only upon the position co-ordinates, 
and not upon those of the velocity, we have 


0L _ dT{q k , q k ) _ 
dq t dq k Pk 


(47) 


As a result, the first and the third terms in the right member of equation 
(46) drop out, and comparison of coefficients gives 


?* = 


dH_ 


(48) 


and 


Substituting these 
obtain 


dL 

dq k 


dn 

dq k 


• (49) 


results in the “ mixed ” equation (44), we 


dp k _ dH 
di ~ dq k 


(49') 


The canonical form of the equations of motion is thus 

dPk = _ dH{p k > q k ) 

di dq k 

df^ ^ ZH{p k , q k ) 
dt dp k 


( 50 ) 
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For conservative forces, to which we restrict ourselves, the meaning 
0 f ft ^ very simple. If we form the total time derivative 


dS v 9 H dq k y dH dp k 
Til~ k dq k dt + k dp k dt ’ 



this derivative vanishes, on account of (50). H is thus a constant of 
the system. Using equation (45) (p. 122), it is readily shown that H 
is the same as the total energy E : the kinetic energy T is a homo- 
geneous quadratic function of the q k , according to equation (41) (p. 121). 
According to Euler’s Theorem on homogeneous functions, we there- 
fore have 



— 2T — Ti k q k p k , 



as may be verified readily in this instance. It then follows from 
equation (45) that 

H(p ki q k )=2T-(T~ U) — T U — E. . (53) 


7. Hamilton’s Principle 


Besides the Lagrangian and Hamiltonian forms of the equations 
of motion of a system in arbitrary co-ordinates, there is a third 
form which is of the greatest importance. This third statement gives 
the equations not as differential equations, but in the form of a 
stationary condition. The idea of expressing a law of nature by speci- 
fying that a certain quantity is to have an extreme value in the actual 
process, is as old as scientific thought itself. The advantage lies in 
the simplicity of the formulation and its independence of any assump- 
tions concerning the particular co-ordinate system selected. The 
derivation of such an extremal law of mechanics, from what has pre- 
ceded, is not difficult. Comparison of the Lagrangian equations of the 
second kind with the Euler differential equation of the simplest varia- 
tion problem (p. 77) shows that they are identical. In place of the 
independent variable x we now have the time, and the function 
designated by f on p. 75 is the Lagrangian function L. Thus the 
Lagrangian equations of the second kind correspond to an extreme 
value of the integral 



The time integral of twice the kinetic energy is called the action . We 
are thus able to express the equations of motion in this manner: The 
natwral motion of a system ( i.e . the actual motion which takes place accord- 
ant to the laws of mechanics) is characterized by the fact that the time 


124 


MECHANICS 


[Chap. 


integral of the Lagrangian function , taken between two configurations of 
the system has an extreme value. This is the celebrated Hamiltonian 
Principle. The foregoing derivation offers no information as to whether 
the value spoken of is a maximum or a minimum, or merely stationary, 
but this question is of no consequence in what follows. In most instances 
IF is a minimum. 


The mathematical expression of the principle is 



8 f '\T-U)dt=h [ tl (2T-E)dt=0, 


(54) 


where E represents the total energy. 

if forces which are not derivable from potentials also act, then, by § 5 (p. 120) 
the Lagrange equations become 



In this case the Lagrange equations no longer represent the Euler differential 
equations of a variation problem. However, they may be brought into this form 
if it is possible to find a function M of the co-ordinates and their derivatives 
such that 


dd_M_SM_ 
dt dq k dq k 


( 56 ) 


In this case the function U defined by 


U =L— M =T - U — M 
satisfies the differential equation 

dd£_dZ / =0 

dt dq k oq k 


■ (57) 

. (58) 


and this is the Euler differential equation corresponding to 

8 / L'dt = 0^ 


(59) 


This case occurs, for example, in the mechanics of an electron in a magnetic field. 
The magnetio force (cf. p. 428) is given by F mag = —(e/c)[v H]. Since this force 
is always normal to the path (the tangent to the orbit is given by v) no work is 
done. It is therefore not possible to derive this force from a scalar potential. 
If, however, we put H = curl A, it may be shown that the function M = e/c 
(Ao) satisfies (56). Naturally, it is not always possible to find a function M, 
i.e. the system of equations (56) may not be integrable. 


8.*Canonical Transformations.* 

It is often necessary to change from one system of variables to 
another. We might proceed by introducing new position co-ordinates 
by means of equations of the form 

* Sections marked with an asterisk may be omitted at a first reading. 
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g* — g*(g 1> ?2» * * * g/)> 

and then, using the time derivatives of these equations, express the 
kinetic energy as a function of the new position co-ordinates and their 
time derivatives. Finally, with the aid of the equations 


dT 



fche new momentum co-ordinates could be determined. Hamilton’s 
principle, however, makes it possible to solve the problem much more 
elegantly, and at the same time much more generally. Let us change 
position and momentum co-ordinates simidtaneoiLsly , and inquire how 
the transformation relationships must be constituted in order that the 
new variables may again have the character of position and momen- 
tum co-ordinates, i.e. that the canonical equations may still be satis- 
fied. For this purpose, it is necessary and sufficient that the new vari- 
ables again satisfy the Hamiltonian Principle. But, in order that this 
be true, it is sufficient that the difference of the integrands of the 
action, expressed in the new and in the old variables respectively, 
should be the total time derivative of a function of the old and new 
co-ordinates; that is, by 

SpA — H(p k , q k ) — + H(p k , q k ) = g*)> ( 60 ) 


(45) (p. 122), 


for if this equation is integrated between the limits t 0 and ty, we obtain 
/ Pm* - #(?*> Jk)]dt — f — H(p k , q k )]dt 

J t % 

= [/(?*. ?*)](, - [/(?*. ?*)](,. 

Upon taking the variation of this expression, the right side vanishes, 
since the end-points are fixed (cf. p. 78). Since the variation of the 
first integral is zero, the variation of the second integral must also 
vanish, which means that the new variables are again canonically con- 
jugate position and momentum co-ordinates. By carrying out the 
differentiation in the right member of (60), and comparing coefficients, 
the transformation formulse are obtained in the form 



d/(gfc> gfc) 

9g* 





d/fo, gt) 




?*) = H(p k) q k ) 



The q k may be computed in terms of the p k and q k from the equations 
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of the second line in (61). If we then substitute these values of the 
q k in the equations of the first line, we obtain the p k as functions of 
the p k and q k . These cumbersome calculations are, however, necessary 
only in rare cases in this general form. The arbitrary function f(q k , q k ) 
is called the generator of the transformation. 

Instead of starting with a function of the old and new position 
co-ordinates, we may start with a function of old position co-ordinates 
and new momentum co-ordinates; this leads to relationships of par- 
ticular importance. Let us add the expression 



to the right member of equation (60). This may be done, since the 
above expression vanishes when we take the variation of the time 
integral, just as does the expression dfjdt. We then write 

Zp k q k — H{p k , q k ) — Zp k q k + B(p k , q k ) 

= df{q k , p k ) dl,p k q k 

dt dt 

and obtain, by comparison of coefficients, 

p k = Vk) 1 


cq k 

df(q k , Pk) 
dp k 

H(p k , q k )= H(p k , q k ) J 


3* = 


• o 


• • 


(G2) 


. . (63) 


9.* Cyclic Variables. The Hamilton-Jacobi Differential Equation. 

If the Hamiltonian function — which represents the total energy 
if the forces are conservative — does not contain a certain position 
co-ordinate q iy then it follows from the first of the Hamiltonian equa- 
tions that 

-§f=»- <« 

That is, the conjugate momentum co-ordinate is a constant of the 
system. Such a position co-ordinate is called cyclic. The name is due 
to the fact that the polar angle </> in central motion has this property. 
If all the position co-ordinates are cyclic, it follows from the second 
of Hamilton’s equations, viz. 



dq k = dH{p ly p v _ 

dt dp t 


• Vf ) 


=fklPi, P 2 > •• • Vt\ • ( 65 ) 
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that the dq k jdt , being given functions of constants of the system, are 
themselves constants, and the position co-ordinates increase linearly 
with time. The integration of the equations of motion may thus be 
performed immediately in the case of cyclic variables. A very impor- 
tant method for the integration of the equations of motion is based 
on this fact. Instead of starting with Lagrange’s or Hamilton’s 
equations, for example, and trying to integrate them, we can begin by 
trying to find the generator of a canonical transformation which leads to 
only cyclic variables. If we call the momentum co-ordinates, which are 
constant in time, a k , this generator must be a function S of the old 
position co-ordinates and of the new momentum co-ordinates, such 
that 

H(p k) q k ) = H(a k ) (66) 

According to equation (63) we must put 

_ a*) _ dS 

Pk — and q k <p k ~ — . . . (67) 

oq k o a k 

The function S is, then, determined by the following differential 
equation: if we substitute dS/dq k for p k in the Hamiltonian function, 
we have for conservative systems 

E Qi’ 9t ) = E ‘ (68) 

which is the Hamilton- Jacobi Differential Equation. Here E is the 
energy of the system, and must thus be considered a constant of the 
system, since the differential equations themselves do not contain 
the value of the energy. We are therefore at liberty to identify E with 
one of the constants a*, e.g. The Hamilton- Jacobi differential 
equation is of the first order, but second degree, since H is a quadratic 
function of the ;?*; the complete solution, then, contains / constants 
of integration (if there are / position co-ordinates), one of which may 

i j • , i ^ we could introduce / arbitrary constants ; 
but m this case E must be a function of the cq, a* . . . tt/ , and may 

no longer be taken as an arbitrary integration constant. Thus the 
solution is either 

S (lk, E, a 2 , a 3 . . . a ,) = 0; E = a lt . . , (69) 
or more generally, 

£(?*> <h> ^ °3 • • • a f) = 0; E = E( cq, da . . . a,). (70) 
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In the former case, we see by (65) that all of the dq k /dt with the ex- 
ception of dq l /dt are zero, since * E = H. 

There is an intimate connexion between the function S introduced 
here as the generator of a transformation to cyclic variables and the 
action function (Hamilton’s characteristic function). During the 
course of the motion, the a k are constant, and so their differentials 
are zero. This means that the differential of S is given by 




dq k = Zp k dq k = Zp k q K dt. 



Now by equation (52) (p. 123), 

£p k q k = 2T. 

i.e. dS = 'IT dt 


and 


S-S, 


o 



(72) 

(73) 


On the other hand, the action function is, by § 7 (p. 123), 

W=J‘2Tdt-Et 

Therefore, W = S — Et 

Thus S represents the part of the action W which does not contain 
the time implicitly. On account of the relationship (75), S is also 
known as the contracted action function. 


(74) 

(75) 


10-* Periodic and Multiply Periodic Systems. Angle Variables; the 
Angle Variables of Keplerian Motion. 

We consider first a system having a single position co-ordinate x, 
e.g. a linear oscillator. The system is said to be periodic with period 

T , if the equation , _ 

^ x(t -f nT) = x(t) 

is satisfied when n is any whole number. According to § 9, a cyclic 
variable </) increases linearly with the time, i.e. 

4> = yt, t = $. 

7 

For this reason, the co-ordinate x is also a periodic function of the 
cyclic co-ordinate <f > . We then call the cyclic co-ordinate (j> an angle 
co-ordinate w if the period is unity, i.e. when 

x{w -f 1) = x{w) (76) 

* It is impossible to avoid the double designation of the energy by E and by H , 
for the following reason: E(p k , q k ) signifies the Hamiltonian function of the position 
and momentum co-ordinates. For conservative systems, this is identical with the total 
energy E, which is a constant of the system. On passing to cyclic variables H will de- 
pend also only on quantities constant in time. In this case we may write E in place 
of H. 
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In the simple case of a point moving uniformly around a circle, the 
polar angle 6 is a cyclic variable, but the angle variable is the quantity 
W = <t>l%7T, for in the latter quantity the rectangular co-ordmate x 
is periodic with period unity. The momentum co-ordinate con- 
jugate to an angle co-ordinate is called an action variable, and is 

designated by J. . . , 

The advantage of using angle and action variables lies in tne 

fact that we obtain the frequency of the system immediately by 

differentiation of the Hamiltonian function; for if E is given as 

a function of the action variables ( H cannot be a function of 

the conjugate position co-ordinates, since every angle co-ordinate is 

cyclic), then 

diO dH . , • /T7\ 

H = = '■*- w = yt + s - • • • ' 77 > 

On the other hand, every periodic function of frequency v is repre- 
sentable, subject to conditions as to continuity which are by no means 
onerous, by a Fourier series of the form 

x = Y*c k # J ' in * vt ( 78 ) 


The series is a periodic function of the quantity vt , with period unity. 
But w is an angle variable by hypothesis, and so x is also a periodic 
function of yt, with period 1, i.e. y is identical with v. What constant 
of the system must be introduced as a new momentum co-ordinate in 
order to make the conjugate position co-ordinate not only a cyclic 
variable but also an angle variable ? At any rate, the generator of the 
transformation must satisfy the Hamilton-Jacobi differential equation, 
and is therefore again denoted by S. If S is known as a function of q 
and «/, then we have, as in § 9, 



dS(q, J) 

a j 


and 




On account of the constancy of J, the increase of w during the course 
of the motion is 



VS(q, J) 
dqdJ 




Our requirement that w be an angle variable may be expressed as 
follows: when the system has returned to its former state, w must 
have increased by 1. If we integrate the increase of w over a period, 
we must have 

1= $ dw = $^jdq = lf d £dg=* 1 fpd q . (81) 

6 (S 711) 
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This requirement is satisfied if we introduce the so-called phase 
integral 


= (j) pdq 


(82) 


as a constant of the system. The symbol denotes integration over 

a period of the system — that is, over a complete range of variability 
of the co-ordinate q. 

According to the above considerations, the process of finding the 
angle variables amounts to the following: the Hamiltonian function 
(i.e. for conservative systems, the energy) is given in terms of any 
arbitrary co-ordinates p and q. Replace p in H(p , q) by dS/dq, and 
obtain the Hamilton- Jacobi Differential Equation (68) (p. 127), which 
becomes an ordinary differential equation if there is but one variable. 
If this equation is solved, or at least reduced to a quadrature, form the 
integral 

*=§%* < 83 > 

over a complete period of q. The integral becomes a function of the 
integration constant E. If the equation is solved for E , we obtain — 
on the one hand — the Hamiltonian function as a function of the action 
variable, and therefore the frequency is given as 


_dE_dH 
v ~ dJ~ dJ' 


(84) 


On the other hand, by substituting J in the integral of the Hamilton- 
Jacobi equation, we can complete the transformation to angle variables. 

Let us illustrate the process with the simple example of the linear harmonic 
oscillator. We have [cf. Chap. V, § 7, p. 93], 


T = imx 2 , 




that is, 


p = mx, 


T - in * 


The Hamilton- Jacobi equation thus becomes 

1 ( 0 + 

2m \dx ) k 


^ = V2mE — mkc*. 
dx 


Since p = dS/dx must remain real, we can find the limits of x at once: x must 
extend from z m j n = — V2 E/k to x max = -f- V2 E/k. As we go from x m j n to x ma3 
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X increases, so that ±, and therefore p and dS/dx, are positive. On the return, 
x decreases, and dS/dx has the negative sign. The complete integral 




2?(IX 


thus represents Vmk times the area of a circle of radius V2 E/k. That is, 


2E 


J — -i- nV mk or E 
k 


_ J fk 

~ 2t rVm* 


From this result, the frequency is 


dH _ 1 fk 

~ dJ ~ 2tt V rri 


as already derived on p. 94 by elementary methods. Indefinite integration of 
dS/dx, and the replacing of E by 


J fk 

2tc Vm 


yields the generator 8 of the transformation to angle variables, and differen- 
tiation with respect to J gives w. The calculation is not troublesome, since the 
integral 


/ 


V 2mE — mix* dx 


is easily evaluated as an indefinite integral. The result is 


w 


= _1_ H i n -i feVbn 

2tt V J 


(85) 


The Hamilton- Jacobi method appears much more complicated here than the 
direct integration, as carried out in Chap. V, § 7. As a matter of fact, the full 
value of the Hamilton-Jacobi method first becomes evident in connexion with 
complicated systems in Celestial Mechanics, or Atomic Mechanics, where it 
proves to be an indispensable aid. 

The application of these principles to systems of more than one 
degree of freedom requires development of the concept of multiply 
periodic — or conditionally periodic — systems, which occur when 
there are several degrees of freedom. Let the rectangular co-ordinates 
x k he periodic functions of the angle variables w k with period 1, i.e. 

x k(u>\ + 1, w 2 + 1, w 3 + 1, . . .) = x k (w v w 2> w 3 . . .), (86) 
and generally, for whole numbers n ki 

w 2 + n 2 , . . .) = x k (w v w 2 . . .). . (87) 

Just as Fourier’s series is the most general representation of periodio 
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functions of one variable, so the general analytical expression for 
functions of several variables, having as many periods as there are 
variables, is the multiple Fourier series of the form 

x k = . . . C) mv * e Zni{niw ' + n ”> w * + "p", + •••). . (88) 

Since the w k increase linearly with the time, we may substitute the 
expressions + S* for w k in the Fourier series. Taking the factors 
e 27nnk * k originating in the phase constants into the coefficients, we 
then have 

X k = S I S m S p . . . Z>* e 2-i(n^«+ »W + rtpvj +..o # (89) 

This multiple Fourier series no longer represents a periodic function 
of the time. It is true that any one factor, e.g. e 2 "*' 1 * 1 ’ 1 * returns to its 
initial value after T x = l/v 1 sec; but this term is multiplied by other 
factors e 2rnnml ' tt which certainly do not return to their old values 
since the separate v k are taken to be independent of each other. A 
true periodicity in time can be obtained only by imposing additional 
conditions on the v k , e.g. the condition v x = v 2 = v 3 = v k . 

For this reason, multiply periodic systems are also called con- 
ditionally periodic. A simple example of a conditionally periodic system 
is a particle performing harmonic vibrations along the x- and y- axes, 
the two vibrations having incommensurable frequencies. As shown 
on p. 60, the resulting orbit is a so-called Lissajous figure which never 
closes, if the ratio of the two frequencies is not a rational number, but 
which, in the course of time, covers all points within a given rectangle. 
All orbits of conditionally periodic motions are of the type of the 
Lissajous figure; a certain portion of space is densely filled, yet the 
orbit is not closed, and thus a true periodicity in time does not exist. 

Conditionally periodic motions occur in general if the Hamilton- 
Jacobi equation (68) (p. 127) can be solved by “separation of the 
variables ” — the only feasible practical method, ass umin g that the 
system has properties of periodicity at all. By separation of the variables 
we mean the solution of the equation by putting, in those cases where 
it is possible, 

^(?i> ? 2 > ?3 • • • Q.f) = ^i(?i) H - ^2(12) “h ^3(^3) -}■••• £v(?/)> (^O) 

, 05 dS k .... 

80 p k = — = * (91) 

^( 7 * dq k 

and the equation may be separated into a number of ordinary differen- 
tial equations of the form 

(^) 2+/(?t) = a ‘ < 92 > 
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the energy E becoming a function of the constants of integration. 

E = E(a li 02 , 03 , . . . o /). .... (93) 


The search for the angle and action variables proceeds just as in 
the case of one degree of freedom treated above: let the angle and 
action variables which are sought be denoted by w k and J k respec- 
tively, and let S be the generator of the transformation, as a function 
of the old position co-ordinates q k and the new momentum co-ordinates 

J k . 

Imagine all the position co-ordinates q k except q h held constant. 
Since the action variables are constants, the change in the angle 
variables corresponding to a change in q h is 


Now, since 




dS(q k > J t) 
dJ k • 


the above expression becomes 

d h w k = 





We now postulate the separability of the Hamilton -Jacobi equation 
in the co-ordinates q ki which we therefore call the separation variables. 


Thus 




The change in w k corresponding to passage through the complete range 
of variation of q h is 

^‘ = 3 Tjst** (97) 


If we now introduce the phase integrals 

J » = $ tit = / .... (98) 

as action variables, then 


A /4 w> fc =1 for h = k ) 
= 0 for h = 1 = k ) 



But this means that the variable of separation q k is a periodic function 
of w k , with period umty. While it is true that each of the variables q k 
is simply periodic, the periods of the separate q k are different, and so 
the system is multiply periodic. Hence the rectangular oo-ordinates, 
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which may depend on the q k in many ways, become multiply periodic 
functions of the w k . The variables w k and J k , introduced in this manner, 
thus fulfil all the conditions imposed above on angle and action 
variables. 

If the energy is expressed in terms of the phase integrals rather 
than the constants of integration a k> the frequencies of the system fol- 
low by differentiation since, according to (65) (p. 126), and the above 
discussion, 

dw k dH 

Vk ~dt 97 *' 

Let us illustrate the method with the example of Keplerian motion treated 
on p. 90. Since the force is a function only of the distance r from the centre of 
attraction, three-dimensional polar co-ordinates are suggested. The line element is 
given by 

ds 2 = dr 2 + r 8 d0 8 + r 8 sin 8 0 dtp 8 (100) 

Then T = \m (^)*= \m(r* + r 8 © 8 + r 8 sin 2 69*), . . (101) 

and the momentum co-ordinates become 

• • 

p r = mr, p e =mr*d t p^ = mr* sin 8 0 9. ... (102) 

Expressed in terms of these co-ordinates, the kinetic energy becomes 

r -K0 , ' , + ?* + ™»‘) (103) 

The potential energy for the gravitational force between m and the central mass 

M is 

v = _V^i = _b (104) 

r r 

Replacing the p k by 8Sf8q k in the Hamilton- Jacobi differential equation, we 
have 

t [df) v + k (H) ,+ rare df )1 - 1 - E - • < 105) 

We put 

S = S r (r) + S e {Q) + ^(9), (106) 

and obtain 

a [@)’ + ?(»)'+ ran (SH - f- * • <‘"» 

We can now separate the equation into three equations with a <*, and ag 8 as con- 
stants of integration: 

d p=<H (108) 

09 

= ^ = (108') 

(%)' + !& ) = “*• (109) 
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or 



. . (109') 
. . (HO) 
. . (HO') 


These equations are somewhat more general than (92) and (93), since (109) and 
(110) contain the integration constants of (108) and (109) respectively, lhe 
subsequent procedure, however, is the same as above. By the second form, the 
integration is reduced to quadratures. In order to calculate the phase integrals 
j the limits of the variables must be determined; the second form of the 
equations is adequate for this purpose. Since a $ is constant, p$, and therefore 9, 
always have the same algebraic sign. If the system is conditionally periodic at 
all, then a complete period of 9 is from 0 to 2k. Then 

/*2 - l 

0^9=2^; ( ni ) 

Jo 


The co-latitude 0 behaves differently. Since pe must remain real and finite, the 
limits are given by 

sin0 max = sin(7t — 0min) = — (^^) 

ag 

With increasing 0, the positive sign of the radical is to be taken; for decreasing 
0, the negative sign, as explained in detail for the case of the linear oscillator 
(p. 130). The value of the integral taken from 0^ to 0 miuc and back to 0nu n is 

Jg = 6 Joe*- -^d0 = 2rc(a e -a*), • • •(!«> 

j V sm 2 0 


as may be calculated readily. Then 


ag 


_ Je 4- 

2k * 


If we substitute this value of ag in the expression for dS r /dr, we obtain 

J r =§ V 2mB + T 5 - *• • • ■ (« 4 > 

If the motion is periodic or conditionally periodic, then, here too, the zeros of 
the expression under the sign of the square root, or radicand, determine a minimum 
and a maximum of r, and these quantities must be positive, since the radius vector 
is always positive. The solution of the quadratic in r, obtained by setting the 
radicand equal to zero, gives two positive values of r only if E is negative. This 
is in harmony with the result of Chap. V, § 6, p. 92, that the orbit is hyper- 
bolio for E positive — in which case there is no finite extreme value of r — while 
negative total energy leads to the periodio elliptio orbit. With the assumption 
that E is negative, the integral is to be extended from r m in to r^, and back to 
r min with the opposite sign for the radical. The evaluation of the integral by 
elementary means is somewhat cumbersome, but possible; however, the result 
is obtained much more readily by complex integration (of. p. 73), and is 


Jr — — J 8 — -f- 


2nmb 

V-2 mE' 


( 116 ) 
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From this wo have 


E = 


- 2tt 


(J r + Je + J t) 2 

which gives the energy E in terms of the action variables: 


• (H6) 


E — H (J r , J e , J $) 

The frequencies may be determined at once by differentiation, 
the special case where 

v _ 5/i _ dH dH 

v '-a7 r - v * = a7r v * = £7V • • 


• • ( 117 ) 

Here we have 

. . ( 118 ) 


since the energy depends only upon the sum J r + J e + J This is in keeping 

with the fact that the elliptic orbit is closed, and hence that we are dealing with 
a true time-periodicity. 



/ 

✓ 


Fig. 1 


Since we may set up as many phase integrals as there are co-ordinates, and 
since the energy, in general, depends upon these integrals individually, one 
should expect as many independent frequencies as there are degrees of freedom 
— in the present case, three. If the number of independent frequencies is smaller, 
the system is said to be degenerate. The difference between the number of degrees 
of freedom and the number of frequencies is called the degree of degeneracy. The 
Keplerian motion is thus doubly degenerate. In this very frequently occurring 
case of degeneracy, the quantities J r , Jo, J $ are not yet the correct action vari- 
ables. If we remove the degeneracy by introducing additional small forces 
(“ perturbations ”), we must insist that some of the angle variables of the result- 
ing non-degenerate system pass over into constants of the degenerate system 
when the perturbing forces are reduced to zero. This is not the case, however, 
for the angle variables which correspond to J r , J 0 , J The true angle variables 
may be obtained by starting with a non-degenerate system and allowing the 
perturbing forces to vanish. These variables, which are known in astronomy as 
the Delaunay elements , have come to be of great importance. We quote them 
here without verification: the first action variable is the sum 

J \ = J r + Jo + */«£. 
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The associated angle variable is the mean anomaly M, which is related to the 
eccentric anomaly u, occurring in the parametric representation of the ellipse 
x — a costt, y = b sinw, by the relationship 

M = (u — e sinu). 


where e is the eccentricity. The second action variable is 2 tt times the total 
angular momentum : J 2 = 2rr P. The associated position co-ordinate is given 
by 1/2 tt times the angle between perihelion and the line of nodes , the latter being 
the line of intersection of the orbital plane and the a^-plane of a fixed co-ordinate 
system. This co-ordinate is constant for an unperturbed Kepler ellipse. The 
third momentum co-ordinate J 3 is 2n times the z-component of the angular 
momentum, i.e. «/ 3 = 2tcP 2 . The associated position co-ordinate w 3 is l/27t 
times the angle between the line of nodes and the x-axis; this also remains fixed 
for an unperturbed Kepler ellipse (fig. 1). 



CHAPTER VII 


The Mechanics of Rigid Bodies 

1. Selected Topics in the Kinematics of Rigid Bodies. 

(a) The number of degrees of freedom of a rigid body 

A rigid body is understood to be a system of particles, the indi- 
viduals of which are rigidly inter-connected, i.e. always maintain the 
same distance from each other. We therefore leave out of consideration 
deformations which occur in all actual physical bodies. 

The position of a rigid body with respect to an arbitrary co-or- 
dinate system fixed in space is determined if the position of the origin, 
and the orientation, of a co-ordinate system fixed in the body are 
given. In order to fix the origin of the latter co-ordinate system we 
need three co-ordinates; to specify the orientation of this system, we 
may use the direction cosines of the axes referred to the system fixed 


in space. If we denote these cosines by a<, p { , y i} then 

i' = a^i + -f- y x k 

j' — + Pz J + Vzk ' (I) 

k = a 3 Z* -f- p 3 j -f- y 3 k . 

But there are six equations connecting these nine coefficients, viz.: 

l= *' 2 =<h 2 +Pi 2 +yi 2 
1 = j' 2 = “a 2 + P 2 + y 2 (2) 

o = i'j' = ajag + P 1 P 2 + riy 2 ] 

0 =j'k! = a 2 a 3 -f P 2 P 2 + y 2 y 3 (3) 

0 = k'i' = Ogaj + -j- y 3 y x J 


i.e. there are but three additional free variables. The position of a 
rigid body is thus completely determined by six numbers (co-ordinates); 
the number of degrees of freedom is therefore six. 

It often proves useful to introduce three independent variables, 
in place of the nine direction cosines conditioned by six equations. 
For this purpose the Euler angles 0, «/r, <f> are suitable. These angles 

188 
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are defined as follows (cf. fig. 1): 9 is the angle between the z'-axis 
and the z-axis. The try-plane cuts the z'y'-plane in a straight line, 
which we again call the line of nodes, and which we define by the unit 
vector n. We draw a straight line and corresponding unit vector m' 
perpendicular to n and in the x'y' -plane. The angle tp is the longi- 



Fig. i 


tude of the node, i.e. the angle between the line of nodes and the 
z-axia. <f> denotes the angle, in the z'y'-plane, between the tr'-axis 
and the line of nodes. If the Euler angles are given, it is not difficult 
to draw the axes i\ j\ k' . Using ip, one can first draw the line of 
nodes in the zy-plane. Through tins line pass a plane makin g an angle 
6 with the zy-plane; this is the xy -plane. The normal to it is the 
z'-axis. In the ®y -plane, the position of the z'-axis is then deter min ed 

by <P- 

The axes 0 x ' t OJ, 0/ are fixed in the body, and O x , O v , 0. are 
fixed in space. 


The number of degrees of freedom is diminished if separate points 
of the rigid body are held fixed. If, for example, one point is fixed, 
we place the origin of co-ordinates 0 at that point; the position of the 
body is then determined by the three Euler angles 0, ip, <f> y and so the 
number of degrees of freedom is reduced to three. If we hold two 
points fixed, the only possible motion is rotation about the axis formed 
by these points. The position is then fully determined by giving the 
angle of rotation, i.e. the angle between a line in the body normal to 
the axis and a plane fixed in space, through that axis. A rigid body 
two points are fixed has then but one degree of freedom, 
tfbk kst degree of freedom is also removed if the position of another 
l§f|$* no ^ bi the axis, is prescribed. We shall consider the motions 
• Ppflfflble fox a rigid body under these various limitations, and finally 
* upon the general motion of a free rigid body. 

'JK-b M- 1 * . ’ . ■ * . 
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(b) Rotation of a rigid body about a fixed axis 

The angular velocity and the direction of the axis together may 
be characterized by a vector in the following manner: we assign a 
vector to in the direction of the axis, such that the rotation is clock- 
wise when looking in the direction of co, and such that the magnitude 
of to is equal to the angular speed at. This representation has a mean- 
ing only if we can show that the angular velocities so specified follow 
the laws of vector analysis, i.e. that two angular velocities around 
intersecting axes combine to give a third angular velocity whose 
axis and magnitude are determined by vector addition. This is actually 

the case. If to 1 is the first angular 
velocity, the linear velocity v 1 of a 
point whose radius vector, referred to 
a point 0 on the axis, is r is given in 
direction and magnitude by the vector 
product [cojr]; for the linear speed (see 
fig. 2) is equal to the product of a > x by 
the perpendicular distance p of the 
point P from the axis, i.e. equal to 
<*>i | r x | sin(co 1 r). Moreover, zq is nor- 
mal to the axis, which is also true of 
[co^]. If now a second axis through 0 
is given, then the linear velocity corre- 
velocity u> 2 is given by z; 2 = [t o 2 r]. 
The resulting linear velocity is thus 

v = v 1 -\- v 2 = [i* x r\ + [c o 2 r], 

which, by the distributive property of the vector product, is 

v = | (^i + w 2 )r | = | cor | (4) 

That is, we may replace the angular velocities co! and to 2 by the 
angular velocity co which is obtained from coj and co 2 by vector addition. 
If we multiply both members of equation (4) by dt, we have the same 
law of combination for infinitesimal angular displacements (rotations). 

However, this important rule for comb inin g angular velocities and 
infini tes ima l rotations must not be understood to hold for the analo- 
gous case of finite rotations about two intersecting axes. True, one 
may assign to every finite rotation a directed line-segment whose 
length is proportional to the angle turned, but — as may be seen at 
once by simple examples — two such rotations combine in a much 
more complicated manner than two vectors. A finite rotation signi- 
fies a transformation of axes, i.e. a tensor; and while two successive 
transformations may be replaced by a single one, the replacement 
cannot be made as simply as for angular velocities. 



sponding to the rotational 
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(c) Plane motion of a rigid body 

Next to rotation about a fixed axis, the simplest case would be the 
motion of a rigid body about a fixed point, according to the above 
enumeration of degrees of freedom. However, it is advisable before 
ta king up this case to consider a motion which also has three degrees 
of freedom — plane motion, in which all points of the body move 
parallel to a given fixed plane. All points on a line normal to this 
plane then describe congruent 
paths, and so it is sufficient to study 
the motion of one plane moving 
over another fixed plane. One may 
picture this motion as that of a 
sheet of paper moving about on a 
plane table-top. It is evident that 
this motion has three degrees of 
freedom, for if we fix one point of 
the moving plane by giving its two 
co-ordinates, we can still turn this plane about the fixed point; the 
position of the moving plane is completely fixed by further specifying 
the angle of turn referred to a fixed axis. It may be shown that the 
general motion at any instant may be looked upon as a rotation about 
an instantaneous pivot point, the instantaneous centre; for if we 
select two arbitrary points P x and the motion of every other point 
is determined by the motion of these. After a time A t the two points 
are at Pf and P 2 ' respectively (fig. 3). Draw the perpendicular bi- 
sectors of P Y Pf and PgPg'; let these intersect at P 0 . The triangles 
P 0 P 1 P 2 and PJPfPf are congruent, 
since the construction makes PqPi = 

PpP/ and PqP 2 = P^Pf* and also 
PiP 2 = PfPf on account of the 
rigidity of the body. As a result, 
the triangle PJPfPf can be brought 
into superposition with triangle 
PoPiP 2 by rotation ab out P 0 . Since 
the position of the body is deter- 
mined, for plane motion, by the two 
points Pj and P 2 , which were chosen 
at random, the same construction with any other pair of points must 
also lead to the point P 0 . If A t is allowed to approach zero, the seg- 
ments PiP 1 and PgP 2 ' become the tangents of the paths, and have the 
directions of the velocity vectors, and are normal to P X P 0 and PJP 0 
respectively. The instantaneous state of motion may therefore be looked 
upon as a rotation about P 0 . This point is the instantaneous centre . 

If the position of the instantaneous centre is marked in the fixed 
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plane, as well as in the moving plane, we obtain (except in the trivial 
case of simple rotation about a fixed point) a curve in each plane. 
The curve in the fixed plane is called the herpolhode; that in the moving 
system, the polhode. According to their definition, both curves must 
have the instantaneous centre in common at any instant. An infinitesi- 
mal rotation about this point must bring the neighbouring points of 
both curves into coincidence. This is possible only if the two curves 
touch at the pole and roll upon each other without slipping. This is 
immediately evident from a diagram in which the curves are pro- 
visionally replaced by broken lines (fig. 4, p. 141 ). 


(d) Motion of a rigid body about a fixed point 

The motion of a rigid system about a fixed point bears the same 
relation to plane motion as spherical geometry bears to plane geometry. 
Just as the points on a normal to the fixed plane describe congruent 
orbits in the case of plane motion of a rigid body, so here the points 
on a radius through the fixed centre describe paths which are similar 
spherical curves, similarly placed with respect to the fixed point. It 
is therefore sufficient to examine the motion on any spherical surface 
having the fixed point as centre. We may thus picture the situation 
as the sliding of a spherical shell upon an inner fixed sphere. We can 
apply the same construction as we used for plane motion, and so 
obtain an instantaneous centre on the sphere. If we trace the path 
of the instantaneous centre, we obtain two spherical curves, the her- 
polhode and the polhode. By rolling the latter curve upon the former, 
the motion is determined exactly as in plane motion. In reality, we 
are not dealing with two spherical curves, one rolling on the other, but 
with two cones, if we consider all the concentric spheres. In the same 
way, in the case of plane motion, we really have two cylinders, whose 
generators are normal to the plane, rolling one upon the other. 


(e) Arbitrary motion of a rigid body 

Since the position of a rigid body is determined by the position 
of one point, and by the orientation of a set of axes fixed in the body, 
a change to a neighbouring position can be effected by a translatory 
motion which brings that point to its new position, followed by an 
i nfini tesimal rotation of the axes to their new position. We can always 
arrange matters so that the axis of instantaneous rotation lies in the 
direction of the translational motion, and a rigid body may therefore 
always be moved to a neighbouring position by means of an infinitesi- 
mal screw motion. We give this result without proof, since no further 
use will be made of it. If we again trace these instantaneous screw 
axes in both the moving and fixed systems, we obtain two ruled sur- 
faces. In the course of the motion, these surfaces not only roll upon 
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one another, as in plane motion or spherical motion, but there is a 
simultaneous slipping along the instantaneous common generator. 

2. General Statics and Dynamics of Rigid Bodies. Equivalence of 
Systems of Forces acting upon Rigid Bodies. 

According to Chap. V (p. 85), the necessary and sufficient con- 
dition for the equilibrium of a single particle (i.e. for the absence of 
all accelerations) is the vanishing of the resultant of all applied forces. 
In the general mechanics of systems of particles, the vanishing of the 
resultants of all force vectors means that there is no acceleration of 
the centre of gravity of the system: i.e. if the centre of gravity was 
at rest before the system of forces having zero resultant was applied, 
it remains at rest when the set of forces is applied. This still does not 
say that the system of particles remains at rest, for rotational accelera- 
tions about the centre of gravity may still take place. For a rigid 
body, the vanishing of the resultant moment 

is the necessary and sufficient condition for the absence of this kind 
of acceleration also. The reason is that the rigid connexions of the 
separate parts ensure that the changes in angular momenta of the 
parts are all of the same sign, so that it is impossible for these changes 
to compensate each other, as may occur in an arbitrary system of 
particles. As a result, when M = 0, equation (7) (p. 110) becomes 

As we have just seen, the absence of a change in the total angular 
momentum P implies that there is no change in the angular momentum 
of the separate parts. If then F also vanishes, and no motion existed 
at first the system remains at rest. The necessary and sufficient con- 
ditions for the equilibrium of a system of forces applied to a rigid body 

are therefore the vanishing of the resultant force and of the resultant 
moment of the forces: 

ZF ( = F = 0 and E [r t F ( ] = M = 0. . . (5) 

The effect of a system of forces on a rigid body depends solely 
upon these two resultants. This is evident for statics, but in kinetics, 
too, only these two vectors occur, as we shall see in the following sec- 
tions. AH systems of forces which yield the same resultants F and M 
are entirely equivalent in effect. 

While the veotor quantities with which we have dealt thus far 
could be moved about parallel to themselves at will, without changing 
*3 significance, this xs no longer true of the forces acting upon a 
rigid body. It is evident that if we displace parallel to itself a foroe 


I 



144 


MECHANICS 


[Chap. 


vector acting at a point P„ we change the point of application in such 
manner that the moment of the force is altered. The moment is un- 
changed only if the displacement is along the vector. A force acting 

on a rigid body is a vector localized in a line and may be moved only 
along its line of action. J 

The value of the resultant moment depends, in general, upon the 
position of the reference point 0. There is an important special case 
where this is not so, viz. if the resultant F vanishes. If, for example 

we shift the reference point to O' by a vector d, and denote the vectors’ 
radiating from O' by primes, then 


^ = r— d, i.e. SO/F,] = LfoFJ - 2[aTF ( ] 

= Ulr.F,] — [<afF]. . . (6) 

The expression Z[a?F,] vanishes, no matter what the direction of d, 

“ .. P< ~ 80 t ^ lat centre of gravity remains at rest or moves 

uniformly. 

The scalar product FM is independent of the point of reference in 
any case, for if we multiply both members of equation (6) scalarly by 
F, the term F[a?F] vanishes, and there remains 


F2[>VF,]= F2[^F,.l. 


(V) 


It may be shown also that there is always an axis such that if any point 
thereon be chosen as reference point, M will have the direction of F. This axis 
is called the central axie of the system of forces. If we denote the moment referred 
to 0/ by M , the vector M should be so determined that 


ar= m- [dF]= y f (8 ) 

where y is a scalar which is yet undetermined. Scalar multiplication by F yields 

M'F= MF= yF a , 


that is. 



MF 

F* 


and [dFJ = M - Y F = M - F(MF > = MOT - F(MF) = [F[MF]] 


From (9) we see that 


F 2 


pa 


F 3 


(9) 






where X is an arbitrary variable scalar quantity. This means, however, that the 
termmus of d is on a straight line parallel to F. If, in particular, M is parallel 
to F , the central axis passes through O and is parallel to F. 


The simplest system of forces having a vanishing resultant F but 
a non-vanishing M consists of two equal and oppositely directed forces, 
F applied at the point and — F at This system is called a couple. 
The resultant moment M = [(^q — y 2 ) F] is a vector perpendicular 
to the plane of the two parallel forces and equal in magnitude to F 
times the perpendicular distance between the forces. Since we saw 
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that the effect of a set of forces applied to a rigid body depends only 
upon F and M, we see that any system of forces may be replaced by 
a single force F and a couple M whose moment is equal to the resultant 
moment of the set of forces. 

If all the forces acting on a rigid body are coplanar — a frequent occurrence 
in practice — then the magnitude and line of action of their resultant may be 
found in a vivid way by means of two graphical constructions known as the 
polygon of forces and the funicular polygon. In fig. 5a the four applied forces Fj, 
F 2 , F„ F 4 are shown each in its correct line of action. In fig. 56 the resultant F 
is determined by the familiar force polygon construction. In order to find the 
correct line of action of the resultant, we select an arbitrary point 0 and draw 
lines from it to the various comers of the polygon. These lines may be thought of 
as auxiliary force vectors F 0 .i, F,.* F 2>3 , F 3 . 4 and F 4 , 0 . From the triangles thus 
formed it appears that each pair of auxiliary forces is in equilibrium with one of 
the given forces, provided that we apply both of the former at a single point on 



Fig- s 


the fine of action of the given force. Except for F 0il and F 4 0 , however, all 
auxiliary forces enter twice, each time with opposite signs; and since the two 
members of a pair act in the same line they cannot change anything connected 
with theequihbrmm conditions. Inasmuch as equilibrium obtains after the 
introduction of the auxiliary forces, F 0il and F 4(0 must be in equilibrium with 
the resultant F, and the line of action of the latter must pass through the inter- 
wetm of those ° f the former two forces. Hence in diagram (a), through any 
point of Fj we draw a line parallel to F 0 . i and another parallel to F x 2 . Through 

^ Ct -°? ° f *5® ^ With / 2 we draw another line parallel 'to F a „ and 

tion is thnf ° f -f°'i, a r d ? 4 ' 0 18 a p0int 0n the resultanti » whose direc- 

tion is that of the closing side F of the polygon in (6). 

IknST ° f I f ne8 sup P Ued the Vagram of original forces is called the 
share P® rtam “g to ? rop ^ 'Polygon because a rope would assume this 

ha^renTtl h h ,f !° n ° f th u e g I Ten foroe3 ' 111 the event thttt tte force polygon 
SStorf h“ tf ' 0l0BI ? 8 ’ T? u lrectl0n3 of and F < » "U be parallel and the 
SSftS* 7“ be . a ““P 1 .?: y ^ two parallel lines are coincident, in which case 
the funicular polygon will be closed also, the given forces will be in equilibrium. 

P Ji e ??A 8, T p0rte f at it3 ends and a concentrated load of 

desJibod fin? tb 0n ? end ' reversing the construction just 

oeaonbed, find the magnitude of the downward forces on the supports. How 

does one proceed when several loads are applied 7 
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3. Rotation of a Rigid Body about a Fixed Axis. Moment of Inertia 
and its Calculation. 

We begin the subject of the dynamics of a rigid body with the 
simplest case — “ one-dimensional ” motion about a fixed axis. This 
motion is determined by a single co-ordinate, the angle of turn (j>. 
Thinking of the axis as a material axle, we may consider it fixed by 
holding two of its points, 0 and O', in bearings. Let the resultant of 
the external forces be F, and the resultant moment, M. If we apply 
the general equations of the mechanics of systems of particles, thus 
treating the body as free, we must also take into account the reactions 
at the bearings. That is to say, if the bearings were not present, the 
axis of rotation would, in general, tend to move in some direction. 
This is prevented by the bearings, and so the axle exerts a pressure 
on the bearings. We may consider the bearings replaced by forces 
which are equal to and opposite to the pressure exerted by the axle 
at any instant. Call these forces, acting at 0 and O', F r and F/ 
respectively. If we put the origin of co-ordinates at 0, the point of 
application of one of these forces, then rF r vanishes, and the equation 
for the rate of change of the total angular momentum of the system 
becomes 

J=[^F/] + M (11) 


We now put the z-axis of our co-ordinate system along the axis of 
rotation. Since the vector product [r'F/] has no z-component, on 
account of y' — r'k , the reaction force for the z-component of equation 
(11) drops out, and we obtain the simpler equation 


dP t 

dt 




which we will now treat further. If the angular velocity vector is 


then 



co = ojk, (13) 

P = 

= co £ m i r t 2 — £ m { r { (c or t ) 

= cokhmir? — a)'Lm i r i {r i k), . . . (14) 

P t = w'Zm i r? — a)'Lm i (r i k) 2 ‘ = <o'Zm i [r i k] 2 

= co£m < (x, 2 -f-y, 2 ) (15) 
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The quantity 


'Zm i [r i kf — Y,m t (xf -f- y,- 2 ) = S«i^ ( 2 



is called the moment of inertia of the body about the 2 -axis, and 
is denoted by l tti for a reason explained below. With this notation, 
(12) gives the equation of motion 



Since the position of the rotating body is determined by the angle <£, 
no other equations are needed if we wish to study only the course of 
the motion. We must bring in the remaining equations of motion, 
however, if we wish to find the pressures at the bearings. Comparing 
(17) with the statement of Newton’s Second Law for the case of one 
degree of freedom of a particle, viz. 



we see that the two equations are mathematically identical, and that 
the following paired quantities correspond to each other: 

Moment of inertia I tz . Mass m. 

d*x 

W 

Component, along the axis of Component of force in the 
rotation, of the moment of direction of motion, 
the external forces. 


Angular acceleration 


d*<j> 
dt 2 


Linear acceleration 


Moreover, the kinetic energy of rotation becomes, in complete analogy 
with the case of linear motion, 

(a)’ <“> 

as may be verified readily. 

In the mechanics of a particle, the vibrational motion taking place 
under the influence of a quasi-elastic force F x = —lex is of importance. 
In the rotational motion of rigid bodies, also, there often arise re- 
storing torques which are proportional to the angle of turn, and whose 
representative vectors are in the direction of the axis. Such a turning 
moment is of the form M = — r<£, and the equation of motion becomes 


I W + T<i> = 0 ’ 



where in place of /„ we write I for brevity, it being understood that 
we mean the moment of inertia about the axis of rotation. The factor 
of proportionality t is called the moment of torsion ; it has the dimensions 
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of force X distance, i.e. work. A torque of this kind arises, for ex- 
ample, when a rigid body, suspended by a wire clamped at the upper 
end, is slightly twisted about the wire as axis. Since equation (19) is 
identical with equation (24) (p. 93), apart from notation, we can 
write down at once the frequency of free vibration of a rigid body 
under the influence of a restoring force proportional to the angle of twist: 




1 / moment of torsion 

277 V moment of inertia 



In order to determine /, we must calculate the quantity 'Zm i h i 2 , 
where h { is the perpendicular distance of the ith particle from the 
axis. For a continuous distribution of matter of density p, the sum is 
to be replaced by the integral JJJ ph 2 dxdydz. We shall find, below, that 
the moment of inertia about any axis may be found without another 
integration, provided that this quantity is known for a particular set 
of three mutually perpendicular axes of the body intersecting at the 
centre of gravity. We show in the first place that the moment of 
inertia about any axis not passing through the centre of gravity may 
be calculated from the value of the moment of inertia about a parallel 
axis through the centre of gravity. If r is the vector to the centre of 
gravity from an origin on any selected axis, and if we denote by primes 
the radius vectors from the centre of gravity to the various particles, 
we have 

r t =r+r t ' (21) 

If the axial direction is denoted by the unit vector c, 

/ = S mj>< c] 2 = Z m,[ rc ] 2 -f- Z m,[*Y c] 2 , . (22) 

since T,m i r i ' = 0 (cf. p. 107), i.e. the term 

2 Sm,[?c] . [r/c] = 2 [rc] . c] 

vanishes. 

The second term of (22) represents the moment of inertia referred 
to the axis passing through the centre of gravity; the first term re- 
presents the moment of inertia of the body about the new axis, con- 
sidering the entire mass concentrated at the centre of gravity. This is 
Steiner's Theorem: 

Th e moment of inertia of a rigid body referred to an arbitrary axis is 
equal to the moment of inertia referred to a 'parallel axis through the centre 
of gravity , plus the moment of inertia of the entire mass about the given 
axis , assuming it to be concentrated at the centre of gravity .* 


• It is sometimes convenient to introduce the so-called radius of gyration k defined by 

k x = = J_ . 


m 


w 


i-e. k is the distance from the axis at which the entire mass may be considered oon- 
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We now calculate the moment of inertia about an arbitrary axis 
through the reference point 0. If the direction cosines of the axis, 
referred to an arbitrary fixed system of co-ordinates, are cos a, cos p, 

cosy, then 

I = 'Lm i [r i cf = Xm { { (y t cosy — z< cos £) 2 + (z< cos a — x { cosy) 2 
+ (®< cos/3— 2/ t - cos a) 2 } 

= Sm,(y,*+ z< 2 ) cos 2 a + 2 m t (z, 2 + a;,- 2 ) cos 2 j8 + S m t (s, 2 + </< 2 ) cos 2 y 

— 22 ntitftZi cos y cos ft — 22m,z < x, cos a cosy 

— 2Xm i x i y i cos a cos ft (23) 

The moment of inertia is thus a quadratic function of the direction 
cosines of the axis. The coefficients of this quadratic function are of 
two kinds; those of the squares of the direction cosines are the moments 
of inertia about the three co-ordinate axes: 

+ 2 « 2 ) = I xx ) 

+ x< 2 ) = lyy | (24) 

2m<(*< 2 + y { 2 ) = l z J 


The coefficients associated with the mixed products of the direction 
cosines are called the 'products of inertia : 


2m,z,x,- = l tx . 



The moments of inertia, as functions of the directions of the axes, 
can be represented graphically by means of a surface, as follows. 
Draw the radius vector R from the origin 0 in the direction of the 
axis about which the moment of inertia is taken. Take the length of 
this radius vector so that 

R ~Vr ‘-h < 26 > 


Referring the resulting surface to a system of co-ordinates with origin 
at 0, we have 


x 


008 a= R~ */I X * cos P=V I V> cosy = y/lz. 


(27) 


Then, according to (23), the equation of the surface which is the locus 
of the end points of R is 

4*z 2 + I v ,y 2 + 7„z 2 — 27 Xv xy - 2l vt yz - 21„zx = 1. (28) 

Migrated without altering the moment of inertia. The theorem may then be stated- 
The square of the radius of gyration about any axis is equal to the square of the 
radiuB of gyration about a parallel axis through the centre of gravity plus the square 
of the distance between the two axes.” The other conclusions of this seotion also 
may be expressed in terms of the radius of gyration. 
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This equation, being of the second degree, represents a conicoid, or 
quadric surface. It is an ellipsoid (in special cases, an ellipsoid of 
rotation, or a sphere), since the moment of inertia of a physical body, 
referred to any axis, cannot vanish, and hence no radius vector 
R = ljy/1 can become infinite. Since the quantities I xx , &c., depend on 
the choice of reference point, there will be a different ellipsoid of 
inertia for every point of reference 0. Now, according to analytical 
geometry, for every surface of the second degree there is a particular 
system of co-ordinates, that of the principal axes, in which the equation 
of the surface has the simple form 


A * /2 + / 2 y ' 2 + / 3 *' 2 = i . 


(29) 


The three quantities / l5 / 2 , 7 3 are called the principal moments of in- 
ertia. If the axes of the principal moments and the magnitudes are 
known, the moment of inertia for any other axis through 0 may be 
evaluated graphically or analytically as the inverse square of the 
radius vector drawn in the direction of the axis in question. If 0 is 
the centre of gravity, the moment of inertia for any other axis may be 
found by applying Steiner’s theorem. A second-degree surface has 
six independent coefficients. In general, then, one must measure six 
moments of inertia, in so far as the positions of the principal axes of 
inertia (which correspond to the axes of symmetry) cannot be deter- 
mined in advance on grounds of symmetry. The six quantities deter- 
mining the ellipsoid of inertia are to be interpreted as follows — three 
for the principal moments of inertia and three additional ones for the 
orientation of the system of principal axes within the body. 

The expression for the kinetic energy of a rotating rigid body is 
specially simple in the co-ordinate system of the principal axes of 
inertia. In general we have 




and since 


CO = to C, 


this expression becomes 


T = ^ 2 Zm,[cr] 2 = ho 2 1. 


But we have already calculated the expression for Sm^O'] 2 in the 
co-ordinate system of the principal axes. Since I kl = 0 for k =\= l, we 
have from equation (23), 


and since 


T = cos 2 a -|- 1 2 cos 2 /? -{- 1 3 cos 2 y), . . (30) 


CO,- 

cos a = — , 
co 



03 X ' 

cosy = — , 

CO 


. . ( 31 ) 
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resolution of to along tlie principal axes of inertia fixed in the body 
makes the expression for the kinetic energy take the form 

T = \{I\ <o X ' 2 4~ / 2 *V 2 “b W)- • * • 

Ex. 48. Calculate the moment of inertia of a solid, homogeneous cube about 
one of its diagonals. 

Ex. 49. Theory of Eater's Pendulum. If a rigid pendulum has the same 
period when oscillating about two parallel axes at unequal distances 8 and s' 
from the centre of gravity, then the length of the simple pendulum having the 
same period is given by 8 + s '- Prove this. 

Ex. 50. What must be the distance of the point of suspension of a pendulum 
from its centre of gravity in order that a small change in this distance may have 
a minim um effect on the period ? 

Ex. 51. For many purposes, a diatomic molecule may be represented by a 
rigid “ dumb-bell ” model, in which the masses M 1 and M 2 are a fixed distance 
a apart. This system rotates as it flies through space. Represent the kinetic 
energy as the sum of translational and rotational energy. 


4. Motion of a Rigid Body about a Fixed Point. Elements of the Theory 
of the Top. 


(a) Relation between angular momentum and, angular velocity 

For the investigation of the motion of a rigid body about a fixed 
axis we needed only the component of total angular momentum in the 
direction of the axis, and not the forces holding the axis in place, so 
long as we were interested only in the progress of the motion, and not 
in the forces at the bearings in which the imaginary axle was con- 
sidered to turn. If, now, one of the bearings is removed, and the axle 
is constrained only at one point 0, the axis will not, in general, re- 
main stationary, but will turn about 0. If we wish to apply the theorem 
concerning the total moment of momentum, we must consider again 
the relation between the angular momentum veotor P and the angular 
velocity vector to, given (p. 146) by 

P = 'Zm i r i 2 (to) — 'Lm i r i {r i to). 


Writing the equation in components, 

P z = + y ? z { 2 )qj x — 'Zm i x i (x i <o x + y i0 ) v + z t co,), 


or 

and similarly, 


Ixx°*x Ixy^y ^xi^a 
= Ixv w x + I vv (ti v — l vx O) z 
Ft = 



(33) 


That is, the components of P are linear, homogeneous functions of the 
components of to. The matrix of coefficients is symmetric, since the 
terms symmetric with respeot to the diagonal are equal. Starting with 
a given vector to, the correspondence between the vectors P and to 
determined by six coefficients, must, of course, always lead to the same 
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vector P, in whatever way the vectors are resolved into components. 
But, according to p. 36, this is the criterion that the six coefficients 
should be the components of a symmetric tensor— the inertial tensor. 
According to the explanation on pp. 36 et seq ., one can find the vector 
P corresponding to a given vector to by using the tensor ellipsoid: 

I xx x 2 + I vv y 2 + IzzZ 2 ~ 2I xv xy - 2 I vz y Z - 2 1 2X zx = 1. (34) 

This ellipsoid is identical with the ellipsoid of inertia discussed in the 
preceding section. To find P, draw a radius vector r parallel to to. 
Then P has the direction of the normal to the ellipsoid at the terminus 
of r. The magnitude of P is determined as on p. 37 by the equation 
upon which the construction of the ellipsoid is based. The normal 
and the radius vector have, in general, different directions, so that 
the vector of the resultant angular momentum does not, in general, 
have the same direction as the axis of rotation. The two are parallel 
only for the directions of the three principal axes of inertia. If we denote 
by accents the components referred to this co-ordinate system, which is 
determined by the form of the body and the distribution of mass, then 

P X ' = \ 

P« = / 2 ov (35) 

Pz — ^3 J 


This co-ordinate system — in which the axes of co-ordinates are the 
axes of the ellipsoid of inertia — is fixed in the body, and thus changes 
direction when the body turns. On the other hand, all the theorems 
already proved, such as the law of conservation of angular momentum 
in the absence of external torques, are based on a system of co-or- 
dinates fixed in space. For example, if in this case of P fixed in space, 
we imagine the vector P to be watched by an observer on the rotating 
body, he will observe an apparent motion of this vector. 

In mechanics, any rapidly rotating rigid body is called a top. The 
easiest case to treat is that of a top having three equal moments of 
inertia. It is called a spherical top. Naturally, the body itself need 
not be a sphere; it is only necessary that the mass be so distributed 
as to make the ellipsoid of inertia a sphere. By (35), P and i o will 
have the same direction for a spherical top. For other reasons, the 
tops or gyroscopes encountered in practice are rotationally symmetric 
bodies having two of their principal moments of inertia equal, with 
the consequence that the ellipsoid of inertia is an ellipsoid of rotation. 
Tops having three different moments are very difficult to handle 
mathematically and almost never occur in engineering practice. They 
have their place, nevertheless, as models for unsymmetric polyatomio 
molecules. We shall restrict our attention to the symmetric top. 

Let us choose the /j'-axis as the axis of symmetry of the rotor, 
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also called the axis of figure. This will also be the axis of figure of the 
inertial ellipsoid. Call I the moment of inertia about the figure axis, 
and J that about an axis normal to the figure axis. Call the ratio 
IjJ = 0. The numerical value of 0 must be between 0 and 2, as shown 
by the following reasoning: on account of the rotational symmetry, 
%' = y\ and we have 


I =J ( x ' 2 + y' 2 )dm — 2 J x' 2 dm> (36) 

J = 1/9 =J \x 2 -j- z' 2 )dm = \I -\-J z' 2 dm. . . (37) 


Since 7/2 occurs in (37) with a positive quantity added to it, we must 
have 1/2 < 1/0 < oo , or 2 > 0 > 0. 

For a symmetrical top, k\ the moment of momentum P, and the 
instantaneous angular velocity vector to all lie in one plane; for the 
construction of P by means of to, using the inertial ellipsoid, is carried 
out in a meridian plane. The same result follows analytically pro- 
vided that to can be written as the sum of a P component and a k' 
component. First, if we write to and P in component form, referred to 
the set of axes fixed in the body, and if we combine the components 
of to that are normal to k' to form a vector to/, we obtain 

to = -j- a ) V j’ -f co^k' = (o„ -j- a) z k\ . . (38) 

^ = 0 ) ^ to n ' -j- Ito^k'. (39) 

Eliminating to n , 

Q 

co = | P- (0 — 1) a > z -k' (40) 


Fig. 6 represents graphically the content of equations (38) to (40). 


(6) Symmetrical top subject to no forces 

Consider now the behaviour of a symmetrical top on which no force 
moment acts. In this instance the vector P will be constant, and so 
differentiation of (40) gives 

/i p\ da*? jt,/ , m dk! 

dt ~ (1 ~ e) w k +Q 

But according to p. 140 the rate of change dr/dt of a radius vector r 

due to rotation is given by [W]; and since any vector, including the 

axial unit vector k\ may be represented as the difference of two radius 
vectors, we have 



and 
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Hence 


^ = a - 0) d ^' h + (1 - 9)w z - [wA*]. 


(42) 


It follows from the last relation that co^ is constant; for if we form 
the scalar product of this equation with k ' , it follows from (41) that 
with k'oi = to,-, that dw z -l dt = 0. However, not only but also the 
magnitude co of co is constant; for if we form the scalar product of (42) 
with to, and use the fact that e*v = const, because co[co&'] = 0, we get 

dio d{to 2 /2) 

0> -7T = ~ = 0 



But since = co£', the angle between co and k' must also be con- 
stant. According to fig. 6, which is valid at any instant, the angle 




Fig. 6 


Fig. 7 


formed by P and k' is also constant, and the only possibility of a 
change in the vectors is for the plane of the figure to rotate. The axis 
for such a rotation must be the invariable line P. The general motion 
of a force-free symmetrical top is called nutation. In it, the instan- 
taneous axis co describes a cone about the axis P, which is fixed in 
space, while the top itself rotates about co. On the other hand, any 
motion of a rigid body about a fixed point may be represented accord- 
ing to § 1 (c) by the rolling of the polhode on the herpolhode, the in- 
stantaneous angular velocity being that of the generator common to 
both cones. Thus in the present instance the herpolhode is the cone 
described about the axis P by co, while the polhode is described by the 
motion of co around k' (fig. 7). Both are right circular cones. Since 
an actual gyroscope always has high speed about the figure axis, and 
since the nutation arises from disturbances, as will be explained below, 
the angles between k\ P and co are small because P has very nearly 
the same direction as k! . However, the angular speed of rotation of 
h around the fixed axis P is comparable with a and so is large. This 
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follows from the fact that the component in question is, by (40), 
(6/1) P. Taking as an approximation | P | = Ico Z i the magnitude of 
this component becomes 6a>^. In a practical case, 0 does not differ 
much from unity. 

Example : The wandering of the earth's pole. The earth is a top whose angular 
speed is email but whose moment of momentum is large, due to its tremendous 
rotation inertia. The polar flattening makes 0 = 301/300. Displacements of 
matter cause slight irregular wandering of the position of the pole over the earth’s 
surface within a distance of about four metres! According to the preceding de- 
velopment, the angular speed around P will amount to 2 tt( 301 )/300 day -1 . This is 
the speed with which an observer fixed in space would see the axis of the earth 
rotating about P. Referring to a terrestrial observer, we must subtract the 
normal daily rotation rate of 2n day -1 , leaving 2 tt/ 300 day -1 , or a period of 300 
days. The observed period is 420 days, the departure being due to the deforma- 
tion of the rotating earth. 

Let us examine one more special case of the motion of a force-free 
top. Suppose to to be constant in direction as well as in magnitude. 
In this case equation (42) can be satisfied only if to and k' are parallel, 
in which case (40) shows that P must be parallel to k' also. Thus, if 
the axis of rotation coincides with the axis of figure, the top can con- 
tinue to rotate about this axis indefinitely. This is the origin of the 
term “ moments of deviation ” used in German works to denote the 
products of inertia; for any axis other than the figure axis these 
mixed terms do not vanish, and rotation about such an axis is accom- 
panied by “ wandering ” of the axis of figure. 


(c) Symmetrical top under the influence of external moments 


Imagine the top K to be contained in a housing G, fig. 8, which 
itself is free to move in all directions — for instance, by being allowed 
to float in a liquid. The centre of gravity of the top itself is assumed 
to coincide with the centre of the indicated sphere. Let the axis of the 
top be again the k' axis. First of all, the top must be set into rotation 
by a force moment having the direction of k! . Once the desired rate 
of rotation is attained, this moment may be gradually decreased. Then, 
if friction is negligible, the top will continue to run indefinitely at this 
speed. Thus it is assumed that any moments thereafter applied to the 
top shall not have k! components. Under such conditions c*v will be 
constant whether or not external moments exist. To see this, set the 
value of P from (40) into the fundamental equation (7), p. 110 , obtaining 


M = — = 1— -L.I (0 — D h’ 4- ^ tf) 

dt 6 dt + 6 { ~dt k + 6 1)<ev 


dk' 

dt 


(44) 


Scalar multiplication by k! yields 






or 
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r, dk' „ . dk' 

since k = 0 and to = 0. 

dt dt 

We assume now that the housing rotates with angular velocity fit, 
and inquire as to the moment needed to produce this rotation. Since 
to is referred to a fixed co-ordinate system rather than to one connected 
with the housing (and hence rotating), one would assume off-hand that 
co would be the vector sum of Q and co z k' . However, since the absence 



Fig. 8 


of friction in the bearings prevents the k' component of £2 from 
affecting the top, we must put 

to = £2 + (to ^ — £2 2 ') k' = £2 n ' -\- oijk' , . . (46) 


where £2 n ' denotes the component of £2 that is normal to k' , Only by 
assuming a relation of this form can we insure that k \ o = c*v. 

Now, let the total change in angular momentum be resolved into 
two components, one of which represents a uniform rotation of the 
housing. Remembering (45) and substituting into (44) the expression 
for to given by (46), we get 


w ld£l n . . 1 dk' I dk' IdSl n . . y dk' 

“-rsr + s"'-* +e (d - 1 ) ”'ik = e-dr+ Iai ' w (47 > 


dk' 

dt 


= \Slk'] = [£l n -kr\ 


On account of 
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we obtain these important forms for the fundamental equation 


M = 


Id£l n - 

e dt 


+ *** w = iir + Io) * [Si ” k ']- ■ (48) 


For any practical gyroscope, Zov represents the main part of the 

angular momentum; let this quantity be denored by P . 

Consider the following special cases: 


(1) Uniform rotation of the housing about a fixed axis. 

This type of motion of the housing, and hence of the axis of the 
top, is much more important than the nutation described above. It 
is called precession, and is the most obvious of all gyroscopic pheno- 
mena. Assume a co-ordinate system i" , j" , k" connected with the 
housing, and let the &"-axis coincide with the axis k' of the rotor 
(fig. 8). Since the component of parallel to k' does not contribute to 
the vector product [&&'], equations (48) become for this case 

M = P^ = P[£lk'] (48') 

The change dk' Idt in the unit vector k! must be normal to k'. Further, 
k ' rotates as time passes, so that the applied moment M must have 
this same rotation. This will be true, in the case of a force having a 
constant direction in space, if its line of action does not pass exactly 
through the centre of gravity of the top. For example, as in fig. 8, let 
a mass m be suspended from the housing at one end of they "-axis, the 
axis of the rotor being in an inclined position. Denoting the unit vector 
of gravity by g 0 and writing 8 for the angle between g 0 and j" (which 
is the same as the angle of inclination of the rotor axis), we have 

M = mga [f'g 0 ] = — mga (sin 8) i " ; . . . (49) 

i.e. the moment always remains perpendicular to k" (or to k'). 

For a given moment, equation (48') in itself yields only the magni- 
tude of the component of the angular speed that is normal to k\ The 
direction of the axis about which the housing continuously turns is 
obtained as follows: 

Since dk' Idt lies in the direction of M, then according to (48') and 

(49), dk'jdt must be normal to g 0 ; hence (dk'/dt) g 0 = 0. Integration 
yields 

k'g 0 + cos (tt/2 — 8) = const (50) 

Thus the axis of the rotor must turn in such a way as to maintain a 
constant angle with the direction of gravity, i.e. the axis must de- 
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scribe a cone about the vertical. The magnitude of the precessional 
angular velocity is, by (48') and (49), 


fl w _ M/P __ mga tan 8 
cosS cosS p • • . (51) 


A gyroscope, then, does not yield to gravity as does a non-rotating 
body— it “ turns aside at a right angle While section (6) showed 
that the aperture of the cone of nutation remains small for small dis- 
turbances, the aperture of the cone of precession is deter min ed a priori. 
Even when the applied moment is small, an easily perceived change in 
direction of the axis soon results. It is only the speed of precession 
that becomes small as the moment is reduced. 


(2) Nutation of a force-free top produced by impulses. 

Let a sudden moment M be applied at time t = 0 to an otherwise 
force-free top rotating about its axis of figure, and let this moment 
continue to act for A t seconds. Denoting the time interval of the 
moment by N, integration of equation (48) with respect to the time 
gives 

N =jf M <u = J Sin + ~dt = lsi n . + Ico, AA'. (52) 

For any impulse, the change in velocity in a short time At is considerable, 
while the change in position is small to a higher order. For this reason, 
A k' may be neglected. Hence, after impact, a new component of rota- 
tion normal to the figure axis and of amount = N0// is present. 
This gives rise to the case of nutation of a force-free top discussed in 
(1), the quantity here corresponding to oj n • there. Apart from 
external impacts, internal rearrangements of matter can also act to 
deflect the axis of P from the c o direction [cf. (6) above]. 

Precession and nutation may exist at the same time, as when a 
precessing top is subjected to external impulses, or when shifting of 
part of the mass takes place. An impulse also must be considered to 
operate at the instant when the moment responsible for precession 
is suddenly applied, for a short-lived force may be represented by 
sudden application of a steady force, followed immediately by the 
application of the reaction force. Since the cone of nutation always 
has a small aperture, the nutation has only the effect of producing a 
corrugation of the surface of the cone of precession. This is usually 
negligible in technical applications. 

(3) Example of the precession of the axis of the earth. 

As a result of the polar flattening of the earth and the inclination of its axis 
(“ obliquity of the echptio ”) the attraction of the sun gives rise to a tu rning 
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moment when the earth is near either end of its orbit (fig. 9a). This moment, 
which seeks to bring the equator closer to the plane of the orbit, vanishes at the 
equinoctial positions. However, since the angular momentum of the earth is so 
large, we know that the precession will be very slow and an average moment may 
be assumed in computing the actual precession. To be added to this is the average 
moment due to the moon’s attraction. Because of the smaller distance, the latter 
moment is of the same order of magnitude as that of the sun. The averaging is 
best done by assuming the masses of sun and moon distributed uniformly along 
the orbits they appear to have as seen from the earth. For this purpose it is 
sufficient to assume circular orbits and to take their planes to be coincident. 
As seen from fig. 96, the gravitational moment seeks to bring the earth’s axis 
into coincidence with the normal to the orbit plane, so that the vector represent- 



ing this moment must be perpendicular to the plane conta inin g the earth’s axis 
and the normal. Hence this vector turns in the orbit plane with the processional 
motion, just as we saw above that the gravity moment vector acting on the gyro- 
scope turned in the horizontal plane. As in that instance, the earth’s avia de- 
scribes a cone about the normal to the orbit plane. The numerical value of the 
period of precession turns out to be about 27,000 years. 

Due to the fluctuating value of the moment, there is a resulting astronomical 
nutation. This has no connexion with the wandering of the pole, whioh was 
explained above in terms of internal shifting of terrestrial matter. 

( 4 ) Example of the gyrocompass. 

Suppose a gyroscope to be mounted in a frame capable of turning only about 
a vertical axis a , fig. 10. If the earth did not rotate, the gyroscope would maintain 
the direction of its axis of rotation, for then it would not be subjeot to any force 
moment. On account of the earth’s rotation around k with angular speed w, the 
mounting— assumed for the present to be fixed to the earth— will be turned- 
and according to the third law, the rotor will exert a moment M' on the mounting! 
Ihis moment is opposite to that which the turning of the axis would produce 
and so by (48') it will amount to ’ 

M ' = -Pco [kk'] (63) 

For an arbitrary position of the axis in a horizontal plane, this moment will have 
a vertical component M ' as well as a horizontal one, M*'. The former, according to 
§ 3, produces rotation about a, while the latter will simply be borne by thtTbSu-- 
mg L, because rotation about a horizontal axis is not possible. However, if the 
a^s of the top lies in the meridian plane determined by the vertical and the 
earth s axis (position k of the latter), then veotor M/ will be horizontal. Sinoe 

1 . Ve ^ 1Cal ™ m P? nent « the frame will not turn about a. In other words, 
the north-south position is one of equilibrium. The stable one of these two direo- 
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tions is that having the smaller angle between k and k' . Because of the smallness 
of oi, the acting moment is so small that it is able to overcome friction in the 
bearings only with great difficulty. The actual set-up resembles fig. 8. To com- 
pensate the moment exerted bv the rotor on the housing, one need only attach to 
the latter a weight m at a point in the plane normal to the rotor axis. In order 



Fig. 10 


that this weight may exert a moment, the rotor axis must not be exactly horizontal, 
but must make a small angle S with this direction. Then, using the notation of 

fig. 10, 

mgaU"go] + M' = mga[j"g 0 ] — Po>[kk' J = 0. . . . (54) 

Since a product vector is normal to the plane of its two factors, equality of the 
products signifies that the vectors k, k\ j" and g 0 are coplanar, i.e. the rotor axis 
must again lie in the meridian plane when equilibrium is attained. 

Ex. 52. In the arrangement just described, compute the inclination of the 
axis of the top as a function of the geographical latitude 9 and the mass m of the 
counterweight. Remember that for any given location the altitude of the pole 
is equal to the latitude. 

Ex. 53. A projectile fired from a rifled gun is given a spin about its axis; 
the projection of the trajectory upon a horizontal plane deviates from a straight 
line. How may this be explained, qualitatively, and what is the direction of 
deviation for a shell having right-handed spin, i.e. clockwise rotation when look- 
ing along the tangent to the orbit? Air resistance tends to raise the nose of the 
projectile. 


CHAPTER VIII 

Elasticity: The Mechanics of Deformable Solids 

1. The Geometry of Small Displacements. 

The ideal rigid body discussed in the previous chapter represents 
an abstraction, as is well known. Actually, the parts of a physical 
solid body, e.g. a bar of iron, are not absolutely rigidly interconnected, 
but suffer small displacements relative to each other under the in- 
fluence of forces. These deformations are, in general, so small that they 
can be neglected in a large number of cases. On the other hand, the 
phenomena governed by elastic properties, e.g. the vibrations of a 
spring, are often of great importance, so that elastic processes offer 
a wide field for discussion in mathematical physics, a field which has 
been extensively cultivated, for practical reasons. We can take up 
here only a few of the chief problems and their solutions. 

In the light of our present concepts regarding the structure of 
matter, we can understand, in many cases, that considerable forces 
are necessary to deform a solid body. In the absence of external forces, 
the solid is in a state of internal equilibrium, in which its fundamental 
constituents (atoms or melecSfes) are at such distances apart that the 
forces between them are in equilibrium. Every change of these dis- 
tances, caused by external forces, immediately causes the attractive 
or repulsive forces to preponderate; their net result, which is either 
an internal pressure or tension, balances the external forces. We may 
construct a crude model by imagining the elementary particles to be 
connected by spiral springs. This model shows us how a local de- 
formation must be transmitted to all parts of the body. The mathe- 
matical computation of the deformation resulting from given forces 
is a problem in Elastaptatics, while Elastodynamics deals with the 
changes of elastic disturbances in time. For example, if, in the above 
model, we displace several “ atoms ” from their rest positions and 
then release them, they will move beyond those points in returning; 
and this vibration will spread throughout the whole body, on account 
of the connexions between the atoms. However, for purposes of cal- 
culation we shall not make use of the atomic picture, but shall treat 
the phenomena from a macroscopic standpoint, assuming that the space 
is continuously filled with matter. This means that the portions of 

7 101 r 



162 


MECHANICS 


[Chap. 


space considered, and also their volume increments, are so large that 
they contain a great number of atoms.* 

We now wish to investigate the geometric-kinematic relationships 
for infinitely small deformations or strains. For this purpose we con- 
sider a region around a point P 0 , whose dimensions are so small that 
the squares of the distances of all its points P from P 0 may be neglected. 
On account of the strain, let each point experience a displacement 
s = ui vj 4- w k> which we likewise consider to be small. More- 
over, let the displacement vary so little from place to place that we 
are justified in dropping products of the derivatives du/dx, du/dy, 
du/dz, dv/dx. ... If the point P 0 experiences a displacement s 0 , then 
another point P, whose position relative to P 0 is given by the radius 
vector r drawn from Po, undergoes a displacement 

« = s 0 + rgrad • s (1) 


If we denote the components of s by u, v y w , the above equation may 
be written 


u=u 0 + r grad u=u Q + 


v=v Q + r grad v = v 0 -f 


tv = w 0 -f- r grad w = w 0 + 



It is intended that the symbol 




should express the fact that, on account of the general hypothesis con- 
cerning the order of magnitude of the displacement components and 
of the volume under consideration, we may use the values of the deri- 
vatives at P 0 in carrying out the computation. Naturally, we rule 
out discontinuities, such as fractures or cracks in the body. 

Our equations give a linear relationship between the components 
of «s and those of r. More important, however, is a related quantity: 

the connexion between an initial vector r = PqP and the correspond- 

This connexion may 
be looked upon as a mathematical transformation of the space 


ing vector r' = P 0 'P' after displacement. 


• See pp. 2, 3. 
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about P 0 , determined by the deformation. From fig. 1 it follows that 

— s 0 ) = r -f- r grad* s or r 1 — grades, ( 3 ) 


or, in components,* 


, /_ 3 u\ 3 u 3 u 

x= ’*( 1 + te) + y dj +z di 

j 3v . /, . 3»\ . dv 

S = X di +y ( 1 + Fy) + Z d-z 

, 3 w . dw /-- . 9w\ 

z = x rx +y Ty + z ( 1 + Tz) 




Thus this transformation, also, is linear. Since the conversion of Y 

to r f is independent of the co-ordinate 
system we select, the coefficients of (4) 
represent the nine components of a 
tensor. 

The correspondence between the space 
about P 0 and that about P 0 ' is that of 
an affine transformation of space, where 
— by definition — straight lines are trans- 
formed into straight lines, and thus planes 
pass over into planes, and parallelism is 
retained. This is immediately evident 
in the vector form: A set of parallel lines in the direction of t 
may be represented by means of a variable parameter A in the form 

r= r t + Xt (5) 

so that one straight line of the family corresponds to each If this 
relationship is substituted in (3), we obtain 

r,=! r <+ M+ ( r * + M) grad ’ ® ! = r< + grad • 5 + A ( 1 grad • s). (6) 

This, however, is again the equation of a family of parallel lines with 

^grad-s, *'=* + tfgrad-s. . . (7) 

The transformation (4) may include still another part, which 
represents the rotation of the neighbouring space as a rigid entity. 
In this case, the vector r is altered, as before, but no internal elastio 
forces arise, since there is no actual deformation of the volume under 
consideration. If we wish to find the forces attributable to changes in 
the distances between particles, we must deduot from (4) the part 

• Fop simplicity, the subscript* of the derivatives are now omitted. 
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concerned with pure rotation. This may be done in the following 
manner: 

On p. 140 it was shown that the velocity of any point of a rigid 
body rotating with angular velocity to is given by 


v = [tor] (8) 

Forming the curl, we get 

curl v = [Vftorj] = to • Vr — toV • r 

= to div r — to grad • r = 3co — to = 2 to. . (9) 

Thus, for a given velocity field, curl v represents a vector equal to 
twice the angular velocity vector.* 

If a rigid body is thought of as rotating for a short time dt , it is 
found that the change in the radius vector f caused by this rotation is 

^rot r = [i*r]dt = £ [rot ^ dt , rj = |[rot r]. . (10) 


The difference between the vector r + d Iot r and the vector as 
specified by (3) represents the change in r due to pure deformation, 
which we denote by d dc{ r: 

^def r = r" — r — d TOi r (11) 

From the deformation alone we then have the relation 

r” = y + d de( r = r r grad • s — £[curl 5, r\ . (12) 


or, in Cartesian form, 



The coefficients, in (12a), now form a symmetric tensor, which may be 


* This is still true if the whole body is subject, in addition, to a translation Vt, for 
Bince Vt is the same for all particles, it drops out in for min g the curl. 

f Actually, one should write curl ds, but according to the original assumptions all 
displacements are assumed to be small quantities. 
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visualized by means of an ellipsoid, as at p. 37. This ellipsoid, which 
has the equation 



may be called the ellipsoid of dilatation, or ellipsoid of strain. Its 
principal axes are called the principal axes of strain. As shown at 
p. 37, the direction of r" is that of the normal to the ellipsoid drawn 
through the end point of r. The two vectors have the same direction 
for the ends of the principal axes. This means that only a segment 
having this direction maintains its original direction. Only those seg- 
ments lying in the directions of the axes of the strain ellipsoid experience 
pure extension without change of direction. 

The coefficients 



have a definite physical meaning: if we consider the termini of the 
unit vectors along the axes, for the undeformed body, then, correspond- 
ing to the vector i, we have, by equations (12a), 







and for the vector/, 



Neglecting all terms which may be considered small according to the 
assumptions made above, we have 


k«"|-l*l Vr'i ' 2 - 1 

“ *1 I 



\ri'\-\j\ _dv 

U\ 



This means that the quantities 


du dv dto 
dx* dy ’ dt 


( 17 ) 
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give the relative changes in length of segments lying in the z-, y-, and 
z-directions respectively. 

In the undeformed state of the body, the vectors i and j are per- 
pendicular to each other; after deformation, they form an angle which 
we may designate by tt/2 - S /y> since it differs but little from tt/2. Then, 

\2 V " | r," | | r," |* ‘ * (18) 

li we use the values of r" and rj' from equations (14) and (15), and 
if we neglect all products of the derivatives, we obtain 

SinS ''~ 8 '' =(§+£) d9) 

Thus the tensor component 2T 12 signifies the change in angle, between 

two segments having the directions of the x- and y- axes, resulting 

from the deformation. Similar results hold for the other components. 

If the axes have the directions of the principal axes of the tensor 

ellipsoid, then the terms T u , T 23 , T 31 are absent from the co-ordinate 

representation. This means that the axes of the strain ellipsoid remain 

perpendicular to each other, which we already inferred above from 

the fact that segments in these directions suffer pure contraction or 
expansion. 

To calculate the change in volume per unit volume (the so-called 
cubical dilatation ), we need only find the difference in the volume 
of the parallelepiped formed by the vectors *, j , k , before and after 
deformation, i.e. the difference \y {' yj' y^'] — 1. On writing out 
the determinant we see immediately that this expression, to a first 
approximation as before, gives 

V" — V du , dv , dw 

~^T~ = Si + Si + = ***• • • • < 20 > 


It is more convenient, for what follows, to consider the strains them- 
selves, i.e. the components of the vector y" — y. These are obtained 
at once from equations (12a) (p. 164) by bringing the components of t 
over to the left side. Introduce the following notation for the deri- 
vatives and their combinations, the physical meaning of which was 
given above: 

du dv 





— e 92 






du 
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We then may write, by (12a), 

x" — x = e 11 x + + khs* 1 

y" — y = ie 21 ® + e 22 y + ie 2 3 2 [• • • • < 22 ) 

z" — z = -f |e 32 y + 6332 j 


The tensor ellipsoid corresponding to this symmetric vector function 
has the same axial directions as the ellipsoid of strain (13) (p. 165). 
This may be shown readily by calculation, but also can be seen at 
once in this way : if the vector r" is to have the direction of r for the 
axis, then this must also be true for the difference vector v” — r. 
The mixed tensor components vanish when referred to the principal 
axes. Denoting by accents the co-ordinates referred to this system, 
we have remaining, 



The quantities e 1 , &>, e z are called the principal extensions. 

Ex. 64. Compute the relative change in length of a segment whose direction 
cosines, referred to the axes of the ellipsoid of dilatation, are a lt c^, a 8 . 


2. State of Stress of a Body under Strain. 

If we consider an element of volume dr, of mass dm, in the interior 
of a strained elastic body, we recognize that two kinds of force are 
operating. First, there are forces like that of gravity, applied to all 
the individual constituents, whether they be in the interior of the 
body or on the bounding surfaces. If the body is being accelerated, 
the “ inertial resistance ” 



is also a force of this type. But, in addition to forces of this kind, there 
are forces operating on the surface particles which are due to the 
presence of the contiguous particles. These correspond to the springs 
in the model described on p. 161 . Since these forces operate between 
adjacent particles on either side of the bounding surface, they are not 
proportional to the volume of the element, but to the area of its sur- 
face. The value of the force acting upon an element of surface, re- 
ared to unit area, is called the unilal stress , or simply the stress , P. 
Ihe direction of this force need not by any means be normal to the 
surface— in general it is composed of a normal stress (pressure or 
tension) and a tangential (or shearing) stress. We can show, however, 
that the stress P on any surface element may be found in terms of 
the normal direction, provided that the stresses are known at this 
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point for three mutually perpendicular directions, which we choose as 
axes of a rectangular co-ordinate system. Thus, we take the volume 
element to be a small tetrahedron, of which three faces — dS x , dS v , dS t 
—lie in the co-ordinate planes, while the fourth — dS— has the outward 
directed normal n (fig. 2). Let the stress on dS be P. In addition, call 
the stress acting on the surface whose normal is along the x-axis P 1? 
and the stresses on the faces perpendicular to the y- and z-axes P 2 
and P 3 respectively. The vectors belonging to the faces which lie in 
the co-ordinate planes have the directions of 
the negative x -, y- and z-axes. The stresses 
Pi, P 2 , P 3 therefore denote the stresses 
exerted by the faces of the tetrahedron on 
the adjacent surface elements, whose normals 
are along the 'positive axes. According to 
Newton’s Third Law, the stresses exerted 
by the neighbouring surfaces on the tetra- 
hedron are then — Pj, — P 2 , — P 3 . If we 
now formulate the conditions of equilibrium 
for our infinitely small tretahedron, we need 
consider only the forces proportional to the areas, since the forces pro- 
portional to the volume vanish to a higher order as we diminish the 
dimensions of the element. Then the first condition of equilibrium 
— the vanishing of the resultant force — is 

P dS — P x dS x — P 2 dS v - P 3 dS s = 0. . . (24) 



Now ndS= idS x +jdS v + kdS„ .... (25) 

that is, 

dS x = dS{ni) = dS cos a; dS v = dS cos p; dS M = dS cosy , (26) 
whence P = P x cos a + P 2 cos ft -f- P 3 cos y . . . . (27) 


Denoting the x-component of Pj by P u , that of P 2 by P 21 , the 
y-component of P x by P 12 , &c., we obtain the component represen- 
tation 

P x — P u cos a -j- P 2 i cos p -f- P 31 cosy ] 

P V = ^12 cos a + P 22 cos p -f- P 32 cos y J- . . (28) 
P z = P i3 cos a -j- P 2 3 cos p -f- P 33 cosy J 


This equation gives every normal direction n a corresponding stress 
vector P. 

The meaning of the coefficients P ik may be seen by considering a 
cube of edge unity, whose faces are parallel to the co-ordinate planes. 
Then, for example, for one of the faces parallel to the yz- plane, 

cos a = 1 , cos P = cos y = 0 , 
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so that the normal component P x is represented by P 1V and the two 
components lying in the surface are represented by P 12 and P 13 . Here, 
then, the coefficients having two equal subscripts give the normal 
pressures or tensions, while those with m i x ed subscripts represent the 
tangential stresses. 

The coefficients again represent the components of a tensor, since 
the connexion between P and yi must, of course, be independent of 
the particular co-ordinate system in which the vectors are resolved, 
and must always lead from a given vector vi to the same vector P. 
The tensor defined by the quantities P i1c is symmetric, as will be shown 
in the following section. It may therefore be represented again by an 
ellipsoid, called the ellipsoid of stress. If the co-ordinate axes are taken 
along the principal axes of the ellipsoid, the mixed tensor components 
drop out, and we have only the three coefficients P lf P 2 , Pq> corre- 
sponding to the diagonal terms of the matrix of coefficients. These 
three quantities are known as the 'principal stresses. The relationship 
between the components of the stress acting on a surface element 
normal to n and the principal stresses is very simple when expressed 
in this system of co-ordinates. Denoting the quantities referred to 
this system by primes, we have 

P X ' = P 1 cosa'l 

iV = P 2 cos ft r (29) 

P t > = P 3 cos y) 


3. The Conditions of Equilibrium of an Elastic Body. 

Consider the equilibrium of the forces applied to a finite volume of 
an elastic body. If we denote the resultant of the forces proportional 
to the volume — the body forces — acting on unit volume by g, and 

apply the fundamental equations of the mechanics of systems of 
particles, we obtain 

f gdr+ fvds=0 (30) 

f[rg)dr+<£[rV]dS=0 (31) 

The surface integral is to be taken over the bounding surface of the 
volume under consideration. In adding the stresses acting upon the 
several volume elements, the internal stresses on the boundary be- 
tween two elements cancel out, since these stresses are applied in 
opposite directions on the two sides of such a surface of separation. 

If we take the x-component of equation (30) we have on the left 

fg*dr + §P X dS= fg,drr+ j (P u cos a + P a cos £ + P S1 cos y) dS. (32) 
The surface integral may be brought into the form appearing in Gauss’s 
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Theorem and then transformed into a volume integral if we introduce 
another vector P 1 : 

*i = p u* + P 2 J + p 3i& (33) 


We shall show, below, that the tensor P iJe is symmetric, so that Pj is 
identical with P x . Using this new vector, 


/ P*dS = $ (P \n)dS = f (PjfZS) =/div P 


(34) 


Since the condition of equilibrium must hold for every volume, we 
have, writing the analogous equations for the y- and z-components: 


dp„ . ap 0 , . dP. 


g x + div P, = 9x + + ^£31 = o 


9v+ divP 2 = <7 V -f- 


3x 
dP 


dy 


dx 


l 2+ ^2 + 


3 z 
3 P 


32 


dy 


3 z 


= 0 


dP 13 . 3 P„, . 3P 


^+divP3-^+^+^ + 2fj3 =0 


(35) 


3x 1 dy 1 3z 
Again, the x-component of the equation of moments becomes 

f(yg. — zgv)dT+(fi(yP t — zP v )dS=0. . . (36) 
Just as before, we may write 

pyP , — zPy)dS= (yP 3 — zP 2 )dS = J (divyP 3 — div 2 P 2 )ir. (37) 
Now 

divyP 3 = y div P 3 -f- P 3 (grad y) = ydivP 3 +P 3 / = ydivP 3 +P 23 , (38) 

so that the x-component of the equation of moments becomes 

y(g z + divP 3 ) - z(g v + divP 2 ) - P 32 + P 23 = 0. (39) 

But, according to the first condition of equilibrium, the bracketed 

symmetry of 

. . (40) 


JL 

the stress tensor: 


p 32 =p 


23 


Similarly, from the other two component equations of (31), 


^13 — P 3\> 


P 21 — P 12 


m 


In addition to the conditions for equilibrium of internal forces we have 
those for the surface forces. If the stresses P e on the surface elements, 
caused by the external forces, are known, then equations (28) (p. 108) 
hold again, and we have 

P x e = P \ i cos a -j- P 21 cos p -j- P 31 cosy'] 

P/ = P 12 cos a -f- P 22 cos -j- P 32 cosy 
P t € = p 13 cos a + P 23 cos ft -|- P 33 cosy J 


. ( 41 ) 
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4. Relations between the Strain Tensor and the Stress Tensor. 


The conditions of equilibrium (35) are not yet sufficient for the 
answer to the question “ What equilibrium state of strain results when 
given external forces are applied?” We still lack the principal thing 
the connexion between the stress and the strain. Since each consists 
of six components, we must expect, in the most general case, that each 
component of the strain tensor is a function of all the components of 
the stress tensor. Ass umin g a linear relationship, this leads to a system 
of equations with thirty-six coefficients. This number reduces very 
considerably, according to the symmetry properties of the substance 
under consideration. If we consider only isotropic substances (i.e. not 
crystals), then, as we shall see, two constants are sufficient. For an 
isotropic substance, the axes of the stress ellipsoid must coincide with 
those of the strain ellipsoid, for the former give the directions for 
which the stresses are normal to the surface; and for an isotropic 
body, only contractions or extensions, but not lateral displacements, 
can occur in these directions. Now, according to § 1 (p. 165), this is 
the essential property of the axes of the strain ellipsoid. We shall 
therefore express the connexion between the stress and strain tensors 
in this uniquely indicated system of co-ordinates. The foundation 
of the so-called classical theory of elasticity is Hooke's Law , which states 
that the normal stress is proportional to the elongation per unit length 
(“ ut tensio, sic vis”). It is understood that we limit ourselves to small 
strains, for — according to experience — Hooke’s Law is no longer 
valid for larger deformations. If we write this law for the axis 1 in 
the form P x = Ee^ then E is the linear modulus of elasticity, or Young's 
modulus, for the material in question. We know from experience that 
a body undergoes a contraction of its transverse sections when stretched 
in the direction of its length. The ratio of the transverse contraction — 
i.e. the change in length per unit length of a transverse segment— to 
the change in length per unit length of the body is, according to ex- 
periment, constant within the limits where Hooke’s Law holds. This 
ratio is called Poisson's Ratio a. That is, if stresses P x , P a , P 3 act in the 
directions of the axes, then the tension P 2 causes a contraction along the 
axis 1. This relative contraction is given by cre 2 , and since P 2 = Ee , it 
amounts to (o/P)P 2 . Similarly, the tension P 3 causes a contraction of 
amount (o/P)P 3 . The total relative change in length is thus given by 


^ = l{ P i-*( P *+ P 3)} = 
ft {^2 ° (P 3+-P l)} = 
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It remains only to transform these equations, derived in this way for 
the special co-ordinate system of the principal axes of the ellipsoids, 
to an arbitrary co-ordinate system, since, in general, the principal 
axes for a strained body change in direction from place to place, 
and thus are not suited to describe what occurs. Let the axis 
1 form an angle (1, x) with the z-axis, an angle (1, y) with 
the y- axis, and an angle (1, 2) with the 2-axis, and let the angles 
between the axes 2 and 3 and the new axes be denoted in similar 
manner. Further, let us characterize the principal axes 1, 2, 3 by the 
unit vectors c v c 2 , c 3 , the new axes by the unit vectors k. Then 


i = c x cos(l, x) 4- c 2 cos (2, x) + c 3 cos(3, x) ' 
j = Cl cos (1, y) + c 2 cos (2, y) + c 3 cos(3, y) 
k = c 1 cos(l, 2) -F c 2 cos (2, 2) -f c 3 cos (3, 2) 



By the rule for transformation of tensor components proved on p. 34 
we obtain 


% = *1 cos 2 (1, x) -F e 2 cos 2 (2, x) 4- e z cos 2 (3, x) 
e 22 = Cl cos 2 ( 1 , y) -F e 2 cos 2 (2, y) + e 3 cos 2 (3, y) 

^ = <4 cos 2 (l, 2) -F e 2 cos 2 (2, 2) 4- eg cos 2 (3, 2) 

IC12 = 2*21 = ^ cos ( 1 , x) cos (1 , y) -F e 2 cos (2, x) cos (2, y) 

-F c 3 cos (3, x) cos (3, y) 
$ e 23 = i e 32 = Cj COS (1, y) cos (1, 2)+ e 2 cos (2, y) cos(2, 2) 

-F *3 cos (3, y) cos (3, 2) 
\ e zi = K3 = Cl cos (1, 2) cos (1, x)-F e 2 cos(2, 2) cos(2, x) 

+ 63 cos (3, z) cos (3, x) 



It is evident from these expressions that the sum 


Cn + c 22 + C33 = div 5 



is equal to -F e 2 ~F Cg, independent of the co-ordinate system. Such 
quantities are called invariants of a tensor. A second invariant which 
will be useful later (cf. p. 176) is the expression 

C11C22 + *22*33 + *33*11 — i(*!2 2 + e 23 2 + * 31 2 ) = *1 *2 + *2*3 + *3^- ( 45 ) 


The same transformation formulae hold for the components of the 
stress tensor P ik . Thus, expressing e 2 , Cg in (44) in terms of P v 
P 2 1 P& using equations (42), and also the relationships 

*1 + p 2 + P 3 = P n 4 - P 22 + P33 

cos 2 ( 1 , x) 4- cos 2 (2, x) 4- cos 2 (3, x) = 1, 


cos(l, x) cos(l, y) 4- cos (2, x) cos (2, y) 

-t- cos (3, x) cos (3, y) = 0, &c. t 


and 
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we obtain 
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The second triad of equations, which is absent if the co-ordinate 
system is that of the principal axes, has the following meaning: Con- 
sider again a cube whose edge is unity, 
and whose faces are parallel to the co- 
ordinate planes. P x is the force acting on 
one of the faces normal to the x-axis; 
the opposite face is assumed to be held 
fixed. Then P 12 is the tangential com- 
ponent, which is in the direction of the 
y-axis. The change in the angle between 
the x-axis and the y-axis is represented 
by ej 2 (cf. fig. 3). According to the first of 
the equations under discussion, these two 
quantities are proportional to one another. The factor of pro- 
portionality 

2(1 + < 7 ) (47) 

is called the shear modulus, and is usually denoted by p or n. This 
quantity, however, is not a new and independent constant of the 
material, but is determined by the values of E and of < 7 , as in (47). 

Similarly, the modulus of compression , or bulk modulus, k, or its 
reciprocal, the compressibility k— are already determined if E and a 
are known. We define k as the change in volume per unit vol um e 
under the influence of unit normal pressure applied equally from all 
sides. According to this definition we have Tcf. eauation 120b (t> lfifiV) 


k = ^ __ T = + ^2 + e 


-VP 


33 


• . . (48) 


Putting in the values of e ik from (46), we have 

- = (*u + ^ + P 33 ) = 3 A=^). . (49) 


Since the application of tension to the surface must 


cause an increase 
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in volume, * cannot be greater than 1/2. Were a = 1/2, the substance 
would be incompressible. 

Solving the equations (46) for the components of the stress tensor, 
we obtain 




<7 

1=2* 

* 

1^~2ct 


( e 11 + e 22 + g 33 ) 

( e U + e 22 -f- ^ 33 ) 


y 


y 



E 

2(1 + a) * 12 ' 

E 

2(1 + a) * 23 ' 

E 

2(1 + a) 631 ' 


(60) 


If we now substitute these values of the stresses in equations (35) 
(p. 170), and if we again introduce the values of e ik from equations (21) 
(p. 166), we obtain a system of differential equations containing only 
the displacements u y v , w and their derivatives. First, the :r-com- 
ponent becomes 


0* + 



2(1 + *) l dx 2 ^ 


2 * 


/ d 2 u dh) t 

I ^ ^ I 


1 — 2 a \dx 2 3 xdy 


dhv \1 
dxdz/j 


E 


\ G~U dh) 

,2 + 


+ 


dho 


2(1 + *) [ dy 2 dxdy ~ dxdz 


+ 


d 2 u) 

d7 2 ) 



This may be written 

E I 


9x + 


2(1 + a) l 


A u + 


1 d / du dv dw 
1 — 2* d x \3x dy dz 



= 0. ^51) 


Similarly, 


9v + 2(1 + a) { Al) + 


1 d /du 
1 — 2* dy \3 x 


and 


9% + 


E 


2(1 + 


In vector form, this system becomes 

E ( . 1 


g+ 


2(1 + 


dv dw N 

+ dy dz > 

)/ = °> 

(51') 

dv dw\ 

+ dy dz > 

)} = °- 

(51") 

grad div s j 

= 0. . 

(52) 


This is the general system of equations for the distortion of an isotropic, 
elastic substance. In addition, we have the surface boundary con- 
ditions, which may be given in the form of information concerning the 
displacements or the distribution of stresses on the surface. 
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5. Energy of Elastically Deformed Bodies; Elastic Potential. 

Work is required to bend a bar elastically. This work remains 
stored in the bar in the form of potential energy as long as the defor- 
mation persists. It may be drawn upon by letting the bar go, where- 
upon portions of the bar attain kinetic energy and go beyond the 
equilibrium position. The energy of elastic deformation is potential 
energy of the forces operating between the particles and is distributed 
uniformly throughout the body. It has its greatest value at places 
where the deformation is greatest. In view of the continuous distri- 
bution, we look for an expression for the energy density — the limit of 
the ratio energy /volume of an elementary volume.* 

To obtain an expression for the energy density Z7, we start with 
a cube of edge equal to unity. In this case the energy content of the 
cube is equal to the energy density. Let the edges of the cube be in 
the directions of the principal axes of strain. As a result, the cube 
suffers only extension or compression, but not shear. In its deformed 
state, the face perpendicular to axis 1 has an area of (1 -f e^fl -J- e^), 
and so the force acting on it is P x (l -f- -j- e 3 ). If now the edge 

parallel to 1 , whose deformed length is (1 -f e 1 ), were stretched an 
amount 8 ^. the stress P x would do an amount of work equal to 

SA, = 2> 1 (l + e 2 )(l + *)«.,. 


(The increase in stress due to the added deformation would make a 
contribution which would be of only second order.) The above amount 
of work represents the increase in potential energy due to the assumed 
change in deformation. Further, the quantities % and e 3 in the paren- 
theses may be neglected in comparison with unity, so that the increase 
in potential energy due to the increase in length of the axis 1 will 
amount to 




= PjSe^ or P x = 




Analogous expressions result for the other axes. Substituting the 
values of P lt P 2 , P 3 from (42) or (50), we obtain 


p_3Z7 E f a 

1_ 3e 1 _ r+^L ei+ i l:r ^ (ei + ea + <!3) J I 

p — du — E r , ° , 1 

2 ~ de 2 ~ r+^p + (e i + * + *>J 

p _3V _ £ f <J -I 

3 des 1 + a p + 1 ~2a ^ + e 2 + 

* 'Ll ** remembered, however, that according to the disoussion 
element of volume must not be made too small. 


(54) 


on p. 3 the 
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It is easy to see that these equations are satisfied by setting 

u ( e v « 2 . e 3 ) = 2(1 E +q) {+ + ei + tf) + -JL- (e, + e 2 + c 3 )®J . (55) 

This expression, which gives the energy content as a function of the 
principal strains, presupposes the use of the principal axes as co- 
ordinate axes to be used at each point, and this system will change 
from one point to another. For this reason, it would be desirable to 
transform the relation to a form that holds for the whole body. To do 
this, we do not have to use the general transformation, since the quan- 
tities involved here may be expressed readily in terms of invariants of 

the tensor. The second parenthesis is such an invariant as it stands. 
Further, because of (45), 

+ e 2 + % 2 = (^i + e 2 + fy) 2 — 2(e x e 2 -f e 2 e 3 -f- e 3 e 1 ) 

= Kl + e 22 + e 33) 2 

- 2 L e U e 22 + *22^33 + e 33^U ~ \(^12 + ^ + Hi)]- 


Then U will be expressed in terms of the e ik as follows: 


U ( e ik) o/i ' ' I 1 4" e 22 + g 33) 2 “( c ll e 22 4 ^22^33 4“ ^ 33 ^ 11 ) 


E_ 1 — a 

2(1+ a) 

+ J( g 12 2 + «23 2 + o] 


(56) 


Comparison with (50) shows again that, in general, 


P« = 


du 

de, 


(57) 


ik 


Finally, the deformations in (56) may be expressed in terms of the 
stresses, giving a quite analogous expression which, however, will not 
be written down. The quantity XJ(e ik ) is rightly called the elastic 
potential, since the stresses are obtainable from it by differentiation. 

6. Elementary Treatment of the Bending of a Cantilever Beam. 

The exact solution of an elastostatic problem requires the in- 
tegration of equation (52), subject to the boundary conditions apply- 
ing in the special case considered. In general, this procedure leads to 
extremely complicated calculations. For this reason, approximate 
solutions have been devised for many technically important problems. 
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Certain simplifications are made in the initial assumptions, and the 
results are sufficiently accurate for many purposes. As an example 
of such a simplified treatment, we give J. Bernoulli’s theory of the 
bending of a beam of uniform cross-section. The Bernoulli theory is 
based upon the following assumptions: Imagine a certain point — 
say the centre of gravity — to be marked in every cross-section. The 
resulting straight line, which we call the axis of the beam, will be 
bent into a curve which lies in a vertical plane, provided that the 

bending is caused only by weights or 
other vertical forces. Further, imagine 
the beam divided into a number of thin, 
horizontal layers, each made up of fila- 
ments. During the bending, there will 
be one layer in the middle, the fibres in 
which are not changed in length, and 
which is therefore called the neutral sur- 
face. Filaments lying below the neutral 
surface are compressed, those above are 
stretched (cf. fig. 4). The flexural rigidity 
or stillness of the beam is assumed to 
be conditioned solely by these changes 
in length. A further assumption of the 
Bernoulli theory is that plane sections 
normal to the axis of the beam remain 
plane and perpendicular to the axis after 
flexure. With these assumptions, it is not difficult to calculate the elastic 
stresses acting upon any particular section. If we consider two neigh- 
bouring cross-sections, distant dx from one another, then these sections 
will form an angle d$ with each other after bending. Measure the x-co- 
ordinate towards the right from the fixed end of the beam, and take 
the z-axis vertically upward from the neutral fibre, at the fixed end. 
The y-axis is horizontal, and directed across the beam from the nearer 
face. If p is the radius of curvature of the neutral fibre, then by fig. 4, 
a layer which is distant z from the neutral surface experiences a relative 
change in length amounting to 



Fig. 4 


(p z)d<f> — pd<j > d<j> z 

dx dx o 


( 58 ) 


since cty/dx is the curvature 1/p. 


The force acting upon an element dzdy of the oross-section ia, 
ording to the definition of Young’s modulus E t 

dF = ^dz dy, 

P 
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The total force acting upon the entire cross-section is then 

F =~ f fz dzdy ( 59 ) 

This resultant must vanish, since neither a change in cross-section 
nor a longitudinal displacement of any section can take place in pure 
bending. This means that the neutral surface always passes through 
the centre of gravity of the cross-section, for this is the origin for which 
JJ z dzdy vanishes. On the other hand, there exists a resultant moment 
of the elastic forces which tends to turn the cross-section about the 
line of intersection of the neutral surface with this section. In fig. 4, 
this moment is in the direction of the negative y-axis. Its magnitude is 

\M\ = JzdF=^ J Jz 2 dzdy = ^ I. . . (60) 

The geometric quantity \\z 2 dzdy is called the moment of inertia I 
of the cross-section. 

Let the cross-section be rectangular, as assumed in fig. 4; if we 
call the breadth b and the depth c, then the integration is to be 
extended from y = 0 to y = b and from z = — c/2 to z = 

The moment of the elastic forces must balance the moment of the 
external forces. If the bending is caused by a load Q (in force units) 
applied at the free end of the beam, where x = a, then the moment of 
this load at the point x is, since the portion a — x may be considered 
a rigid lever arm, 

I M'\ = Q(a-x) (61) 

The direction of this moment vector is that of the positive y-axis. 
The condition for equilibrium of a cross-section distant x from the 
fixed end is thus 

PI 

— = Q(a — x) (62) 

We can now readily obtain the equation of the neutral axis after 
flexure. Since the curve departs but little from the horizontal direc- 
tion, we may write 

1 _ _ <Pz 
p dx 2 


at all points.* 


Thus we obtain the equation 




(a — x ). 



* The negative sign conics from the fact that the curve is concave toward the 
negative *-axis. 
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Integrating twice, there results 



Q /ax 2 x?\ 

-~~Ei V"2 ~l) 


- ) + CjX + c 2 . 


(64) 


Since z and also dz/dx vanish at x = 0, both constants 
of integration are zero, and we have for the depres- 
sion of the end of the beam (at x = a). 


= — 


Q a 3 

El 3* 


( 66 ) 


The deformation of the cross-section, which was 
not taken into account by Bernoulli, may also be 
given without difficulty, provided the cross-section is 
initially rectangular. Since a transverse contraction 
takes place above the neutral surface, where the fibres 
are stretched, and a transverse expansion occurs below 
this surface, the originally vertical sides of the rectangle 
must converge toward the top, so that the section now 
has the form of a sector of a circular annulus (fig. 5). 
Let p be the radius of curvature of the line of inter- 
section of the neutral surface with this cross-section, 
and call the radius of curvature of the neutral fibre 
Following the same method as above, we find that 
the relative transverse extension (in reality, contraction) of a layer 
which is a distance z above the neutral surface is given by 

(p —z)d(j)— pd<j) _ d<f> __ z 

dy ~ dy~~ p f 

Now the longitudinal dilatation at the same point was found to be 
+z/p, by (58). According to the definition of Poisson’s Ratio, we 
then have 


p, as before. 


a = — £_ or 

P 


P ' = - p - 
(7 


( 66 ) 


Thus the originally plane layer at height z is bent into the form of 
an anticlastic surface whose two radii of curvature are in the ratio 
o\ This fact is the basis of an elegant and simple method of determin- 
ing a (see Ex. 56). 

Ex. 65. Calculate the deflection of a tube of length a , radii r { and r A> supported 
at the ends and carrying a concentrated load at the mid-point. Hint: In order to 
reduce this to the above case, imagine the middle of the tube clamped fast, and 
an upward force equal to half the load applied at eaoh end. 

Ex. 56. A plate of rectangular section, supported along its central line, is bent 
by loads apphed at the ends. A plane parallel plate of glass is laid upon the bent 
plate at the centre, and the system is illuminated from above by monochromatic 
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Jight. The interference fringes (p. 410), appearing at the thin layer between the 

plates, furnish a contour map of the surface resulting from the bending Bv 

approximate representation of the surface in the neighbourhood of the point 

of contact, show that the contour lines are hyperbolas, the angle between the 

asymptotes being related to ct in a simple way. Hint: Observe that for the point 
of contact r 


c 2 z _ l 8 2 z l J , 

dx 2 p’ 8y 2 ~ ^ and P = 



and that p and p' may be considered constant in the neighbourhood of this point. 

Lx. 5/. A wire of length l and radius of cross-section a is clamped at one end 
the other end being twisted through an angle <p. Compute the resulting moment 
of torsion, which will be proportional to 9 , by imagining the wire divided into 
thin, concentric hollow cylinders, and these in turn subdivided into narrow rect- 
angular elements that suffer pure shear when the wire is twisted. 


7. Waves in Unbounded Elastic Media (Seismic Waves). Longitudinal 
Waves in Bars. 

The fundamental equation (52) (p. 174) holds for elastokinetics 
as well as for elastostatics. We wish to apply it to the kinetic case 
and study the disturbances arising in an unbounded medium sub- 
jected to no external body forces. In this connexion, the inertial 
forces given by d’Alembert’s principle are 

d 2 s 
9 dt 2 


per unit volume, p being the density. If all other body forces are 
absent, equation (52) becomes 



E 

2(1 + a) 





From among the many integrals of this second-order partial differen- 
tial equation, we select in the first instance the simple special case 
where the displacement vector 5 depends only upon one co-ordinate 
—say x and thus derivatives with respect to y and z vanish. In 
co-ordinate form, we then have 


d 2 u__ E I d 2 u 1 d 2 u\ 

P dt 2 ~ 2(1 + a) \3F 2+ 1^2^ faij 

J 2 v_ E d 2 v 

P dl 2 ~ 2(I+^j dx 2 

d 2 w E d 2 w 

p dE = 2(1 + 0 ) dx? 


E( 1 — ct) d 2 u ' 
(1 -f- <*) (1 — 2a) dx 2 


>. m 


/ 
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These are, however, equations of plane waves propagated in the direc- 
tion of the z-axis, as comparison with the wave equation (47) (p. 62) 
shows. The coefficient of d 2 u/dt 2 gives the square of the reciprocal of 
the wave (phase) velocity, if the coefficient of d 2 ujdx 2 is reduced to 
unity. It is evident from (68) that the wave velocity for a displace- 
ment along the z-axis — i.e. in the direction of propagation — differs 
from the velocity for displacements normal to this direction. Waves 
of the first kind are called longitudinal — those of the second kind, 
transverse. The wave velocities are 



c 


trans 


E( 1 - o) 

p( 1 + o-) (1 — 2a)' 

E 

2p(l or) * • • 



There is another less formal way of arriving at the classification into longi- 
tudinal and transverse waves, which has more direct physical significance. If 
we replace As by grad divs — curl curl 5, according to the familiar rule of vector 
analysis (p. 40), then (67) becomes 

* 7 ? = 2(fT?) {t^ ** diT s “ curl curU } ‘ ’ (6r) 


We now divide 5 into two parts, s 1 -f- s 2 , which are to satisfy the conditions: 


curl = 0, div Sj 4= 0 i 

div s 2 = 0, curl s 2 #= 0 / 



This means that s, represents a compressional wave, since, by equation (20) 
(p. 166), the volume dilatation is measured by 

div S = ^ + !i + £?. 

dx dy dz 


At the same time, the individual volume elements experience no rotation, since 
curl = 0. On account of the postulated independence of y and z , this part 
has only an x-component: 


a»uj_ E( 1-c) 0*u, 

P dt 2 (1 + a)( 1- 2a) 0x*' 



Comparison with (68) shows that the compressional wave is identical with the longi- 
tudinal wave. On the other hand, compression is absent for s 2 , sinoe div = 0. 
As a consequence of the vanishing of the divergence, and of the vanishing of dvjdy 
and dwjdz separately, dujdx also vanishes. The non-compressional wave of 
shear, as we may call this part, has therefore no component in the direction of 
propagation, and is thus transverse. The actual proof of the identity of this 
wave with the transverse waves of (68) is obtained by writing the equations for 
the y- and z-components of (67'), which yield 


0*v 2 _ E 8>v t 0>u> 2 _ E 0 a 

dt* 2(1 4- o) dx* 9 dt * 2(1 -f a) dx*'* 



These are the equations (68) obtained above for the transverse waves. 
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The ratio of the velocities of propagation of compressional and 
shearing waves is 


/2(1 — cr) 

c t V 1 - 2(7 



The value cr=l/4, which holds for many substances, makes this 
ratio equal to \/3. In earthquakes, both types of wave are observed; 
observation of the ratio of the velocities permits the calculation of o 
for that part of the interior of the earth in which the waves travel. 
The value thus obtained is 0-29. 

Apart from these waves in the body of the substance, there are 
longitudinal as well as transverse surface waves which follow the sur- 
face of separation of two media having different elastic properties. 
These waves correspond to the electrical surface waves treated on 
p. 345, and are called Rayleigh waves , after their discoverer. 

The propagation of waves in an unlimited medium is to be dis- 
tinguished from propagation along a bar whose transverse dimensions 
are small compared to its length. If we imagine the unbounded medium 
divided into such bars, whose axes are along the direction of propa- 
gation, we see that the boundary conditions are different for the single 
isolated bar and for a bar imagined cut from an unlimited medium. 
In the former case the transverse contraction can take place without 
hindrance, while the surface of the bar in the second case is elastically 
bound to the surroundings, and so a transverse contraction causes 
counter stresses. For this reason it is much easier to treat longitudinal 
waves in thin bars. We need not go back to the general equations, 
but can obtain the differential equation in the following simple way. 
Consider an element of the bar included between two cross-sections 
distant dx apart. Call the extension on the left side of this element 


and that on the right side 


Then, if the area of the section is A , and if Young’s modulus is E , the 
resultant force acting on the element is 





and Newton’s Second Law gives 



dx = AE 


d 2 u 
dx 2 


dx 



or 


d 2 u p d 2 u 

di 2 ~E W 


(76) 
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The velocity of propagation of longitudinal (compressional) waves 
in a bar is thus 



7. Transverse Vibration of Stretched Strings and Membranes. 

In many cases it proves expedient to start, not with the general 
equations — modifying them to fit the special problem in hand— 
but to proceed as we did in treating longitudinal waves in bars; viz. 
to derive the equations immediately from the fundamental principles 
of the theory of elasticity, thereby taking advantage of the simplifying 
conditions of the problem. The transverse vibrations of stretched 
strings and membranes form a case in point. These vibrations are not 
only of great physical importance, in acoustics, but their discussion 
leads to mathematical operations applicable in the most diverse 
branches of physics, especially in atomic physics. 

By a string we mean an elastic body whose cross-section is so 
small compared to its length, that it offers no appreciable resistance 



Fig. 6 


to bending. The fact that the stiffness of an elastic bar diminishes as 
the cross-section decreases follows from the results of § 6 (p. 176). As 
the cross-section is made smaller and smaller, we finally have only 
the fibres in the immediate neighbourhood of the central filament, 
and there are no longer any appreciable stresses opposing the bending. 
How is it possible, then, for a string to execute vibrations? The state 
of affairs is completely altered by the fact that the string is stretched 
between two fixed end points. Even in the equilibrium position, 
there exists a stress P normal to the uniform cross-sections A , i.e. the 
tension P is always in the direction of the tangent to the central line 
of the string. Because of the complete flexibility of the string, trans- 
verse forces do not exist. If the string is now drawn aside from its 
position of rest, the resulting increase in length causes an increase in 
tension. But even this increase in tension is not the true cause of the 
transverse vibrations; we shall see, below, that it is a small quantity 
of higher order than the true cause of the vibrations. The latter is 
much more direct: in the strained state, the tangents to the string, 
drawn at the ends of an element dx, have slightly different directions. 
As a consequence, there is a resultant force in the direction normal to 
the rest position of the string (see fig. 6). If we consider only vibra- 
tions in one plane, and take the plane of vibration to be the ay-plane, 
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then the y-component of the displacement, which was previously denoted 
by v, is simply y, since the rest position of the string is the x-axis. 
In the displaced position, a force APt(x) acts at the left side of an 
element dx, if we call the unit tangent vector t. Similarly, a force 
APt(x-\~ dx) acts at the right side. If a is the inclination to the hori- 
zontal, the components of the resultant are 


F x = AP cos (a 4- da) — AP cos a, 1 
F v — AP sin (a -f- da) — AP sin a. / 



We limit the displacements to small values, so that we may set 


COS a = 1 - 


sin a = 


a = tan a = 


dy 

dx 


Also, the change in the tension P is small compared with the value of 
the tension itself. Neglecting all quantities of second order, the re- 
sultant in the transverse direction becomes 



(In the longitudinal direction the change in P would need to be taken 
into account, since only terms of second order appear here.) 

Using Newton’s Second Law, the equation of motion of transverse 
vibrations of the element becomes 




d lM= £ 

dx 2 P dt 2 



This equation has the familiar form of the wave equation (cf. p. 62). 
It is to be noted that the tension enters here, but that the modulus 
of elasticity of the string does not appear. 

The solution 

y 1 = Ce? 

corresponding to a wave travelling in the direction of the positive 
:r-axis, satisfies the differential equation, but not the boundary con- 
ditions, viz. y vanishes at every instant for x = 0 and for x= l, where 
l is the length of the string. Similarly, 

y 2 = (7 e *^(r+xV77p) 

is also a solution of the differential equation, and corresponds to 
a wave moving in the opposite direction, but it, too, fails to satisfy 
the end conditions. On account of the linearity of the differential 
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equation, tlie difference of these two particular solutions is also a solu- 
tion. Moreover, we can give at least certain particular values of o) 
for which the boundary conditions are satisfied. The difference is 

y= y 2 —y 1 = Ce at {eh x v (>IP — e ~ iuiX ^ p /p } = 2 i Ce* Mt sin (81) 

It is evident that the condition that y vanish for x = 0 is satisfied 
for every value of t. On the other hand, y = 0 for x = l only when 
wlV p/P is an integral multiple of it. If n is any positive integer, there 
is therefore a corresponding frequency v n , given by 

'•-iVf < 82 » 


We may thus consider the vibration of the string to be a standing wave 
resulting from superposition of two oppositely directed waves of the 
same frequency v n . Application of the boundary conditions no longer 
permits all arbitrary frequencies, but only a discrete set. This set, 
it is true, contains infinitely many values, since n ranges from 1 to °o . 

This result may be obtained in a somewhat different way. On 
account of its fundamental importance, this method also will be ex- 
plained here. If, as usual, we assume equation (80) to have a solution 
of the form 

y=w(x)#' >t (83) 

then we have, after removing the time factor e* ut , the following 
differential equation in x alone: 


If we put 


this becomes 


dho . o) 2 d 

dx 2 ' p w ‘ • 1 

► . . . (84) 

• 

• 

• 

II 

IjajO, 

3 

. . . . (85) 

dho 

w + ^ = o. . . , 

. . . . (86) 


This equation has a solution consistent with the boundary conditions 
only for certain values of the parameter A 2 , which are called charac- 
teristic values , or proper values. Taking, as a solution, 


w = sin hx, 


(87) 


we see that A must be an integral multiple of n/l, in order that y vanish 
tor x = i.e. 




Utt 

T* 


(88) 
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and thus, as above, v n = — 

22 

The functions which correspond to the characteristic values, and which 
are solutions of the differential equation, and which — in addition — 
satisfy the boundary conditions, are called the char acteristic functions, 
or proper functions, of the differential equation.* In the present case, 
the characteristic functions are the functions 



. riTT 

sin x. 


Since the real as well as the imaginary parts of equation (83) must 
satisfy the differential equation separately, a solution which satisfies 
the boundary conditions is given by 



a n cos27rv n £ sin 


nrrx 

~r 



as well as by 
But the series 




sin 27 rv n t sin 


mrx 

~T 



"v® 0 . . nrrx n - eo 

y= 2 j a n cosztt^ sin — ? — (- E h n 

n=l t „=i 


sin 27 rv n t sin 


nrrx 

~T 



is also a solution, since a sum of particular solutions also satisfies 
the equation. In the case of an infinite sum, it must also be provided 
that the series converges. The coefficients a n and b n are determined 
by giving the initial displacement y 0 and initial velocity y 0 as func- 
tions of x: i.e. apart from the spatial boundary conditions to be ful- 
filled at every instant, certain initial conditions must also be satisfied. 
Now 

y 0 {x)= Z a n sin— p, (93) 

n= 1 fr 



n=c« 

E 2-nv n b n sin 

n= 1 


mrx 

~T' 



The determination of the coefficients follows the method developed 
for those of the Fourier series (p. 54). The right member of (93) is 
simply a Fourier series which is to represent the given function in the 
interval 0 to l. To determine the coefficient a m , for example, we mul- 
tiply both sides of the first equation by 

. 771772 

sin — 7— 


* The Gorman expressions for “ characteristic value ” and “ characteristic func- 
tion ” are Eigenwert and Eigenfunklion, 
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and integrate from 0 to l. All terms on the right vanish, with the 
exception of that containing 


rmrx 


l ’ 


which, by p. 55, gives the following value for a m : 


f y o(*) 


. 7TL7TX , 

sin —j— ax. 


. . . (95) 


similar 


mr •'O 




. rmrx , 
sm — = — ax. 


. • (96) 


The vibrations corresponding to the various frequencies are of dif- 
ferent strength, depending upon the initial conditions. The vibrations 
of the string are transmitted to the ear by the intervening air, and — 
in so far as they lie within the audible range — are perceived as a tone 
whose pitch is given by the fundamental frequency v v and whose 
quality, or timbre, is determined by the amplitudes of the various 
harmonics, with frequencies v n which are multiples of v v Thus equa- 
tions (95) and (96) are the mathematical expressions of the fact that 
a given string tone sounds different, according as it is produced by 
plucking or by striking the string. As an example of the determination 
of the coefficients, let us choose the struck string; this corresponds, 
say, to a pianoforte string. For t = 0, let y be zero everywhere. Let 
y 0 be zero at all points except between (1 — a)/2 and (l -{- a)/ 2, where 
it is to have the value v 0 . Thus a is the width of the hammer. Then 


a m = 0 and 


l Jl+a)IZ 

= 7 / V 0 81 

7rv m t ' 


. rmrx , 
am—^—ax 




cos 


tyitt / 1 — a 


)- 


cos 


mrr (l -f- a 


)}• 


(97) 


If the string is not struck in the middle, as assumed here, the coefficients 
will have other values. 

It is not difficult to treat the two-dimensional analogue — the 
vibrating membrane— in the same way. Here, again, let the mem- 
brane be so thin that it possesses no appreciable stiffness. As in the 
case of the string, it is only because the membrane is fastened at the 
eqg^hat restoring forces arise. Assume the hounding ourve to be 
plane, its plane being the ay-plane of our system of co-ordinates. The 
j^nrtant tension P acts upon every element of area in a section normal 
to the plane of the membrane. Consider an element dxdy in the dis- 


1 
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placed position of the membrane. Then the tensions 
yz-plane have a resultant 




parallel to the 


where D is the thickness of the membrane. The tensions in a direction 
normal to this have a resultant amounting to 

F 2z = DP dxdy . 


Altogether, we have a total force acting in the direction of the z-axis 
of amount 



DP dxdy ( d ~ 




+ 



Applying Newton’s Second Law, the equation of motion for the element 
under consideration becomes 


or 


pDdxdy ^ = DP dxdy (g + g) 

d 2 z d 2 z p d 2 z 

dx 2 ^~ dy 2 ~ P dt 2 


The spatial boundary condition to be satisfied in addition to the dif- 
ferential equation is that z = 0 along the bounding curve at all times. 
Generally, at the boundary, we could prescribe definite temporally con- 
stant values of z or of dz/dx, but not of both, as one might conclude from 
the degree of the differential equation. It is easy to see that the prob- 
lem would be over-determined by specifying both z and dz/dx. Imagine 
the integration carried out numerically or graphically, piece by piece, 
in a strip parallel to the y-axis. We begin with a surface element 
whose position and orientation are given, and join the next element 
to this, the orientation being determined by the differential equation, 
which gives the increase of dz/dx. If we proceed in this way, we shall 
not necessarily encounter the boundary a second time; hence only 
the position or the orientation may be specified for the bounding curve. 

We again assume that the solution is of the form 

z=w{x, y)e*\ (99) 


and obtain, just as in the case of the string. 


d 2 w 

dx 2 



d 2 w 

dy 2 


-}- \hu = 0 , 



( 100 ) 


In general, this equation, too, has a solution which satisfies the boun- 
dary conditions only for a discrete sequence of characteristic values A„*. 
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The nature of the corresponding characteristic functions depends 
upon the boundary curve. They may be complicated higher trans- 
cendental functions — e.g. Bessel functions, in the case of a circular 
boundary. Nevertheless, for given initial conditions, the coefficients 
of a series composed of characteristic functions may be determined 
as for the Fourier series. Thus it can be shown that all the charac- 
teristic functions of the equation A w + A ho = 0, which vanish on 
the boundary, have the property of orthogonality; i.e. in analogy to 
the vanishing of the integrals 


we have in general 



( 101 ) 


the integral being extended over the entire range of w , i.e. in our case, 
over the area of the membrane. We give at once the proof for the 
three-dimensional case: If f m and f n are two different characteristic 
functions, then, by Gauss’s Theorem, 

§ (fm grad/ n -/„ grad/J dS=J (/„ A/„ A/Jdr, (102) 
which, by (100), is equal to 

/ (A m 2 — A„ 2 )/„/„dr (103) 

The surface integral is to be extended over the boundary of the region, 
the volume integral over the region itself. But/ m and/ n are always 
zero on the boundary, and so the left member of (102) is zero. Since 
Kn and A n are in general different, we have 

f fmfn dr =0 (104) 

If we now write an arbitrary point function g(x, y, z) as the sum of 
characteristic or proper functions f it 

9( x > y> z) = <hfi + <* 2/2 + • • • <*m/m + . . . (105) 

we may proceed to determine the coefficients exactly as with Fourier’s 
senes If we multiply both sides by f m and integrate over the entire 

?i 0Un< ir regl °i n t]lQ case of tile membrane, over the entire area), 
the orthogonality makes all terms of the series on the right, except 

f a mfm 2 d T, 
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vanish. The coefficients are thus 



. . ( 100 ) 


The characteristic functions are determined only to a constant factor. 
This multiplier is usually so chosen that the integral 

j/Jdr 


has the value 1. In this case the characteristic or proper functions 

are said to be normalized to unity. Consult the additional discussion 
on p. 703. 


Ex. 58. Give the solution of the differential equation for a rectangular mem- 
brane of dimensions a and b. [Try a solution of the form iv(x, y) = p[x)q(y).] 



CHAPTER IX 


The Mechanics of Liquids and Gases 
(Hydro- and Aero-mechanics) 

!, Equilibrium of Fluid Bodies (Hydrostatics). 

The liquid and gaseous states — especially the latter — are charac- 
terized by the fact that no resistance is offered to a change in shape 
which is unaccompanied by a change in volume. While liquids have 
extremely small compressibilities (i.e. the forces opposing a change 
in volume are large), those of gases are very great. Moreover, gases 
tend to expand, filling the entire available volume. If we apply to 
fluids the theory developed in the last chapter for deformable bodies, 
then the absence of rigidity of form is expressed by the fact that the 
shearing stresses P u , P 2 3 , Pzi are zer0 f° r an ideal non-viscous fluid. 
(It is to be remembered that the term “ fluid ” includes both the 
liquid and gaseous states.) Actual fluids, it is true, are subject to 
tangential forces when in motion. If two layers slide over each other 
with a certain relative speed, they exert upon each other tangential 
forces proportional to the relative speed. Thus these forces vanish 
for a state of rest, so that it is not necessary to distinguish between 
viscous and non-viscous fluids in hydrostatics. The vanishing of the 
tangential (or shearing stresses) P 12> P 2 3, P 81 implies the vanishing of 
the terms in xy y yz and zx in the equation of the tensor ellipsoid, and 
this is true for every co-ordinate system. This means that every 
co-ordinate system is a system of principal axes, which is possible only 
if the ellipsoid is a sphere. Hence, in this case, all the normal stresses 
^u» ^ 22 > P 33 must be equal. We set them equal to — p.* Thus a 
normal pressure p , of equal strength in all directions, acts upon any 
element of volume of a fluid. As a consequence, the general conditions 
of equilibrium of a deformable body become greatly simplified. The 
system of equations (35) (p. 170) becomes 

g = grad p (1) 

In hydromechanics it is customary to refer the body forces to unit 

« e “PS 1 8tr ® asea ,^ the theory of elasticity are reokoned positive when 
< ^ rec ti o n of the outward drawn normal to a dosed surface, we must give the 
tnwara prefigure p the negative sign* 

m 
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mass rather than to unit volume. If p is the density of the fluid, and 
G is the body force acting upon one gramme of fluid, then the body 
force on unit volume is pG, and the fundamental equation of hydro- 
statics becomes 

G=igradp (la) 


Just as it was necessary to introduce a relationship between the 
stress and strain tensors in elastomechanics, so a connexion between 
density and pressure must be supplied in hydrostatics. In general, 
the compressibility of liquids may be neglected, and so p may be con- 
sidered constant; for gases, the required connexion is given by Boyle’s 
Law (see below). 

Of the many special problems of hydrostatics and aerostatics we 
select two— the pressure gradient in the atmosphere and the form of 
the surface of a rotating liquid. For the first problem, we take a 
z-axis vertically upward, and the x- and y - axes in the horizontal plane, 
which is the surface of the earth, since we limit ourselves to a small 
portion of the surface. If the density at a height z is p(z), then the 
force acting upon unit mass of air is —gk. Since x and y do not enter, 
grad p reduces to (dp/dz)k y and we have the equation 

% = W 


Now for gases at constant temperature, the density is proportional to 
the pressure, by Boyle’s Law; that is, 


P_ = P_ 
Po Po 


(3) 


Using this relationship, equation (2) becomes 

— = — — gdz. 

V Vo 

Integrating between z = 0 (at the earth’s surface) and z = z, 

log — = — — gz 
Po Po 


or 


P = p 0 e-^*\ 


(4) 


This formula is due to Halley. It is to be remembered that Halley’s 
formula is true only provided the temperature is constant at all levels. 
Nevertheless, it furnishes a fairly good representation of observed 
pressures at various altitudes. 

The integration of the hydrostatic equation is particularly simple 
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if the density may be considered constant and if the external forces 
are derivable from a potential U. In this case 

G = —grad U, (5) 

and (la) becomes 

grad U + i grad p = 0. 

If p is constant, this equation may be integrated at once, giving 

U - = const (6) 

P 

This equation states that, under the assumptions made, the surfaces 
of equal pressure are equipotential surfaces of the acting forces. Now 
the free surface of a liquid is always a surface of constant pressure, 
the pressure being that of the atmosphere. Hence the free surface is an 
equipotential surface. For a fluid at rest and subject only to gravity, 
this surface of equal potential is a sphere with centre at the earth’s 
centre and radius equal to the distance of the surface from this point. 

We next investigate the form of the surface of a liquid contained in 
a vertical cylindrical vessel and rotating, along with the container, 
about the axis of the cylinder with angular speed w. This is only 
apparently a dynamical phenomenon. After a brief period of adjust- 
ment there results a state where the liquid rotates as a whole, along 
with the vessel, and relative movement of the particles no longer occurs. 
Thus an observer rotating with the system notes a state of equili- 
brium. Apart from gravity, the only force acting upon the individual 
volume elements is the centrifugal force, which we may treat as an 
externally applied force also. Taking the positive 0 -axis upward along 
the axis of the cylinder, the potential of gravity becomes gz. The 
centrifugal force is radial, and so is normal to the cylinders 

Vx 2 -f- y 2 — r = const. 

The force vector is directed outward and has the magnitude coV, and 
hence is the negative of the gradient of a function 




a > 2 r 2 


Then, according to equation (6), the surfaces of equal pressure, in- 
cluding the free surface, are given by 


co 2 r 2 


z — = const., 

2 9 


( 7 ) 


i.e. they are paraboloids of rotation whose meridian seotion has the 
equation 

( 7 ') 


8 


* — 2 ^ jc 2 = const, 


U7U) 
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If the origin of co-ordinates is placed at the lowest point (vertex) ol 
this curve, the equation of the surface is 


2 



The vertex of the parabola is, of course, below the plane of the non- 
rotating liquid, while the liquid at the walls is higher than this plane. 

It may be mentioned that the phenomenon just discussed has been used 
to produce a concave parabolic mirror for optical purposes by rotating a vessel 
of mercury. The method, however, was not very successful. 


Ex. 59. Using the hydrostatic equations, derive Archimedes * Principle, 
which states that a body immersed in a fluid is buoyed up by a force equal to the 
weight of the displaced fluid. (Remember that the upward force is the resultant 
of the pressure forces.) 


2. The Fundamental Hydrodynamical Equations. 

If a fluid is not in a state of rest (equilibrium), we say that a con- 
dition of flow prevails. This state of flow is completely defined only 
if the flow velocity is known at every point at each instant. In prac- 
tice, however, we deal almost wholly with problems of stationary 
states, i.e. with cases in which the flow velocity vector at a given 
point does not change with time. If, for example, we inquire about 
the lift experienced by an aeroplane flying with uniform velocity, we 
may consider the aircraft stationary and the air blowing past it from 
infinity with uniform velocity. This is actually the arrangement used 
in wind tunnel experiments. The distribution of flow about the aero- 
plane is then steady. In the non-stationary state, two kinds of curve 
are to be distinguished — the lities of motion , or paths described by 
single particles of the fluid in the course of time, and the stream lines , 
which are defined as follows: At a given instant let the flow velocity 
vectors at each point be drawn in the form of small arrows. These 
arrows may be put together to form curves whose tangents at each 
point have the same direction as the arrows. In the non-stationary 
state, the picture of the stream lines changes from instant to instant. 
Consequently, a particle which moves from P to P' in a time dt has 
a velocity at P' different from that given by the stream-line picture 
corresponding to the time when the particle was still at P. In this 
way we see that the lines of motion and the stream lines coincide only in 
the stationary state. 

In hydromechanics, just as in elastomechanics, we perform the 
transition from statics to kinetics by introducing the inertial force 
for unit volume, 

d 2 r 

~ p W’ 
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by means of d’Alembert’s principle, thus referring the kinetic prob- 
lem back to statics. There will be equilibrium between this force, the 
body forces and the pressure forces when, according to equation (la) 

(p. 192), 

pG ~ p H? = gradp ’ 


or 






In anticipation of the most important special problem of stationary 
flow we resolve the total rate of increase of velocity per unit time, 
i.e. the vector d 2 r/dt 2 = dv/dt, into a purely temporal part and a 
purely spatial part. The latter arises from the fact that, in the time 
di , the volume element moves to a place r + dr , where the velocity is 
v + dv = v -f dr grad v. Reduced to unit time, this part of the 
acceleration is 



dr 

= ^ grad • v = v grad • V. 



where d r signifies the spatial change. 

For non-stationary motion, on the other hand, the flow con- 
figuration changes during the time dt, so that the particle at r + dr 
is in a state corresponding to the time increment dt. Neglecting terms 
of higher order, we may put for this purely temporal change the value 
obtaining at the point r. Let this acceleration be denoted by dv/dt. 
The total acceleration is equal to the sum of the purely temporal 
and purely spatial rates of change of velocity, and so equation (8) 
becomes 

dv 1 

■oT + &rad v = G 

Ot p 

Besides this fundamental hydrodynamical equation we have another, 
called the equation of continuity. The mass of fluid passing outward 
through the surface of an arbitrary volume element, per second, is 

j) pv dS. 

The scalar product v dS automatically provides that inward flow be 
reckoned negative. The quantity flowing out must be equal to the 
decrease, per second, in the amount of fluid within the volume element, 
provided we rule out the possibility of the existence of sources or sinks 
within the volume. That is, 


grad p. . . . (10) 



• • • • « 
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If we convert the surface integral into a volume integral, using Gauss’s 
theorem, then, since the equation must hold for every element of 
volume, we have 


div (pv) = — 


9p 

dt 



For incompressible fluids this equation of continuity simplifies to 


div v = 0 



In view of the meaning of v grad • v given on p. 32, the com- 
ponent form of the general hydrodynamical equations may now be 
written 



and the equation of continuity becomes 

d(pv x ) , d{pv y ) , 3 {pv 


dx 


+ 


Zy 


+ 


3 z 


3p 
3 1 


(10a) 


(12a) 


The co-ordinate form of the fundamental hydrodynamical equa- 
tions indicates the special mathematical difficulties arising in hydro- 
dynamics. Since products of velocity components with their deri- 
vatives occur, the equations, as compared with those which have pre- 
viously occurred, are not linear, and the rule for writing the general 
solution as the sum of particular solutions is no longer valid. This 
rule, which has wide applications to linear partial differential equations 
occurring in physics, e.g. the electrical field equations, has been used 
above, in the treatment of the vibrating string (p. 186). 

We limit ourselves, in what now follows, to the case of stationary 
flow, and so put all pure time derivatives equal to zero. We again 
transform the resulting equations for the stationary state and obtain 
a further classification of the flow patterns into rotational and irra- 
tational flow. From the vector formula 

[A curl B] = [A [VB] ] = V . A c B — AV • B = grad A C B — A grad B 

derived on p. 38, we have for B = A: 

[A curl A] = grad A C A — A grad. A = £ grad A 2 — A grad. A. 



IX.] THE MECHANICS OF LIQUIDS AND GASES 197 

If we apply this formula to the expression v grad . v , we have, for 
stationary flow, 

! grad v 2 — [v curl v] = G — - grad p. . . (13) 

The integration of the equations becomes relatively simple if we are 
dealing with a type of flow for which curl v vanishes in the entire space. 
Thus, if curl v is zero everywhere, then v itself may be represented 
as the gradient of a scalar quantity O, the velocity potential, and the 
problem is solved when the single scalar quantity G> has been found 
as a function of the co-ordinates. 

Ex. 60. Using equations (10a), calculate the equilibrium rate of flow of a liquid 
from a hole in the bottom of a vessel filled to a depth h. 

Ex. 61. In the same way, calculate the velocity of escape of a gas confined 
under a pressure p x — p 0 in excess of outside pressure. Assume that Boyle’s 
Law, p/p = const., holds here, and neglect the gravity force acting upon the 
gas. 


3. Irrotational Flow. 

(a) Bernoulli's Equation 

If the body force per gramme is derivable from a potential, which is 
almost always the case, we have G = — grad U. In this instance, if 
we integrate equation (13) along a stream line, ds and v have the 
same direction, and so [v curlajds vanishes. Then 

r p r p 

/ %(gradv 2 )ds — [acurla]ds 
Jp t 


p. 


= J (—grad TJ)ds — j ^gradp^tLs . (14) 


becomes, for incompressible fluids, 

/n 2 


\ + V + V - = \ + U 0 + a = const. 

Z p Z p 


(15) 


The values of the constants are in general different for each stream 
line. However, if the flow is irrotational — i.e. if curia vanishes 
throughout the entire liquid — then, by equation (14), the integration 
may be performed along any arbitrary curve. In this case, the con- 
stants must be the same for all stream lines. Reciprocally, we may 
conclude that if the flow originates in an irrotational * region, where 
the quantity 

v i + u 0 + Pj> 

l p 

• This limitation muat be made on account of the possibility of ourl v and 
V being parallel. 
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has the same value for all stream lines, then the flow remains irro- 
tational in the entire region. This is true, for example, for the outflow 

of liquid from an orifice in a vessel, since all three terms of the ex- 
pression 

V f + U 0 + P-0 
1 ? 

have the same values all over the free surface. 

(6) Calculation of ir rotational {or potential) flow 

The Bernoulli equation (15) shows that irrotational states of flow 
are not only mathematical simplifications, but also that they must 
occur very frequently in nature. If, for example, air is blown past an 
aeroplane model in wind-tunnel experiments to determine the lift, 
then it is true that the air entering the canal has a given velocity and 
a given pressure, and that the gravitational potential — which, more- 
over, is negligible here — has practically the same value for all stream 
lines. Consequently, according to the above laws, that flow pattern 
which corresponds to the solution based upon the assumption of 
absence of rotation is the one which will form about the model. 

This pure potential flow, however, leads to paradoxical results. 
For example, imagine an infinitely long circular cylinder imm ersed in 
a uniform, parallel flow which is normal to the axis of the cylinder. 
According to the above principles, the fluid will exert no force on the 
cylinder. The discrepancy is to be attributed to the fact that even 
for fluids of low viscosity the friction at the surface of the body is not 
negligible. According to experience, the liquid clings to the body in 
the immediate neighbourhood of the surface, while pure potential 
flow requires smooth, unhindered motion. Due to the clinging of the 
liquid there is a large velocity gradient in the boundary layer and curl v 
no longer vanishes there. Although the viscosity is small, it causes 
the boundary layer — which in the limit is to be thought of as a sur- 
face of discontinuity (see p. 40) — to roll up and detach itself, producing 
individual vortices. Such surfaces of discontinuity also can form 
where two streams of different velocity meet, e.g. behind the trailing 
edge of an airfoil. The fact that the methods of potential flow are still 
usable, especially in the two-dimensional case, is due to the possibility 
of representing vortices mathematically and correcting in this way 
the potential flow. Such cases, however, are no longer irrotational in 
the strict sense of (a).* 

Let us now consider the mathematical treatment of irrotational 
flow. We put 

v = grad O (16) 

* In many books on hydrodynamics the term “ turbulent ” is reserved for vortox 
motion in the narrower sense, while parallel flow having a velocity gradient normal to 
the flow, so that curl v =1= 0, is referred to as “ non-irrotational 
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The equation of continuity for incompressible fluids then becomes 

div v = div grad O = AO =0, ... (17) 

and the actual hydrodynamical equations become 

» 

\ grad v 2 — G — - grad p .... (18) 

P 

for steady flow. Thus every function O which satisfies the differential 
equation AO = 0 represents the velocity potential of an irrotational 
fluid motion. The problem is then to find the solution which corre- 
sponds to the physical conditions of the special case under considera- 
tion. With this solution at hand, the velocity field may be obtained 
by taking the gradient, and the actual hydrodynamical equations (18) 
are necessary only for the calculation of the pressure distribution. 
As the simplest example of potential flow in space, we investigate the 
spherically symmetric solution of the equation AC> = 0. Since 0 
depends on r alone, the equation has the familiar form (cf. p. 63) 

2dO_ 

dr 2 - ~r dr 

As is at once evident, a solution is 



0 =- + 6 . 
r 


( 20 ) 


By taking the gradient we obtain 


v = 


r 2 r o- 


( 21 ) 


This means that the flow is directed radially outward from (or inward 

toward) the point 0. In the former instance, a source exists at 0— in 

the second case, a sink. The constant a is determined by the volume 

of fluid Q flowing through a sphere about 0 each second. This amounts 
to 


Q = — 4 tt 7- 2 - n = —4770 


( 22 ) 


TJub a is determined if Q is given. The additive constant 6 does not 

attect the flow, since it drops out when the gradient is taken. Usually, 

however, the potential is so normalized that it vanishes at infinity, 
m which case 6=0. 1 

A problem which is rather more troublesome to solve is that of 
the flow pattern resulting when a spherical obstacle la rdaaeri „ 
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parallel stream whose potential function is O = v The special 
conditions in this case are: At a great distance from the sphere, the 
fluid motion must be the undisturbed parallel flow; and at the surface 
of the sphere, the normal components of the flow must vanish. The 
fluid, considered to be ideal and non-viscous, may move freely along 
the surface of the sphere. In this connexion, the newer hydrodynamics, 
which corresponds better with observation, differs from the classical 
theory. The more recent theory starts with the assumption of a cling- 
ing boundary layer, as indicated above (p. 198). 

We shall not reproduce the solution of this problem of the sphere 
here, since the essential point may be seen from the two-dimensional 
problem of the flow past an infinitely long cylinder, viz. that the 
resultant force on the obstacle vanishes, which is, of course, com- 
pletely at variance with experience. 

If the flow is taken to be two-dimensional, i.e. if the flow pattern 
is the same in all planes normal to the z-axis, then a large number of 
solutions of the potential equation can be obtained with the aid of 
conformal representation (p. 69). The connexion between the two- 
dimensional potential problem and conformal mapping of the com- 
plex plane is as follows. 

Introducing the potential function, we have 




and div v = AO = 0 (equation of continuity). . (17) 


The condition for irrotational motion is 


dVy 

dx 


dVz 

dy 




the co-ordinate z being absent. This condition is automatically fulfilled 
by our assumption. If, now, we take another function T* and put 




then the equation of continuity is identically satisfied. In this case, 
the condition that there be no rotation requires of T* that it satisfy the 
equation AT* = 0. If, then, in the two-dimensional case, we have any 
function £1 which satisfies the potential equation, we can obtain a 
system of stream lines from it in two ways — first, by taking the gradient; 
secondly, by means of the correspondence (24). If, reciprocally, a given 
flow is specified on one hand by the potential function O, and on the 
other hand by the stream function T* according to equation (24), then 
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since the components v x and v. 
equal, we have 

ao 


obtained in these two ways must 

9Y ^ 







But these are precisely the Cauchy-Riemann differential equations 
(see p. 71), satisfied by the real part O and the imaginary part Y of a 
function of the complex variable 2 = x -f iy. Since, in addition, O 
and Y satisfy Laplace’s equation, we may say that a flow pattern is 
given by every complex function 

Q=0 + tY=/(x-b%); .... (26) 

the curves O = const, are the equipotential lines, and their orthogonal 
trajectories are the stream lines. But according to the theory of con- 
formal mapping, the curves O = const, are orthogonal to the curves 
T* — const., so that the curves Y = const, are really the stream lines. 
By interchanging the functions O and Y it follows that a second flow 
pattern is given by the conformal representation if we take the curves 
Y= const, to be the equipotentials and the curves <I> = const, to be the 
stream lines. If we take the derivative* 

dCl 30, JT 

w= & = Tx = di + l dt = v *- w ” • • < 27 > 


we see that the complex number w agrees in magnitude with the flow 

velocity, and is the reflection of the vector v in the x-axis. Hence w 

is called the conjugate complex velocity, Q. is called the complex poten- 
tial. 

We now take a particular form for f(z), which, as will be seen at 
once, corresponds to cylindrical symmetry, viz. 


/( 2 )= O + iY = a log 2 or z = et* +i *'*l a = 


= e* la (cos- + i sin^ . (28) 


Separating reals and imaginaries, we find 


* The equation 


dz 


x = e* /a cos — 

a 

Y 

y — e* la sin — • 

a 



an 

follows from 


an 

dx 
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By eliminating T* or O, these equations give 

x 2 y 2 — e 2p,a or O = a log r, 


• • 


. (30) 


and 


y . 1 f 

- = tan — . 


(31) 


The equipotential curves O = const, are circles about 0; the 
stream lines are therefore straight lines radiating from 0 (fig. 1, a). 
This flow thus represents the two-dimensional analogue of the spheri- 
cally symmetric flow from a source, considered on p. 199 . The con- 
stant a is determined by the amount of fluid passing through unit 
length of a circular cylinder whose axis contains the source. 

On the other hand, if we interpret the curves T* = const, as equi- 
potentials, then the circles are stream lines (fig. 1, 6). This flow pattern 
is the prototype of a circulation. If the point 0 is isolated from the 



Fig. i 


field of flow by circumscribing a small circle, then curl v is zero in all 
the remaining space. On the other hand, the line integral (j)vds taken 

along one of the circles (or more generally, along a curve enclosing 0), 
no longer vanishes, for ds always has the direction of v along the 
circles, and all contributions to the integral are positive. We shall 
return to this type of flow in § 5 below. 

We now consider the two-dimensional analogue of the problem 
of the sphere referred to above (p. 199), viz. the case of a circular 
cylinder imm ersed in a parallel stream. We take a co-ordinate system 
such that the undisturbed flow is parallel to the x-axis, and look for 
a function II = f(z) which allows the entire H-plane to be mapped 
on that portion of the z-plane which lies outside a circle of radius R. 
It is further required that the normal components of the velocity 
vanish at the surface of the cylinder — i.e. that the circle itself be a 
stream line, for which T* = const. — and finally, that the complex 
velocity w = dQ/dz shall pass over into the real quantity v = \v\ for 
large values of z. As will be shown immediately, the function 
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fulfils these conditions. This is equivalent to 

O + iT = (l + ^- 2 ) + ivy (l - (32') 

By solving the equation for z , it will be seen that for every value of 
Q there is one, and only one value of z whose modulus is greater than 
R; hence there is a unique correspondence between the points of the 
Q-plane and the portion of the z-plane outside the circle. Moreover, 
we see that the curve T* = 0 gives the circle x 2 -f- y 2 = R 2 and the 
straight line y — 0, i.e. the circle coincides with a stream line. Finally, 
the value of Cl becomes vz for z large, and so the complex velocity 
w = dCl/dz becomes equal to the real quantity v. 

Because of the complete symmetry of 
the flow pattern (fig. 2) in front of and 
behind the obstacle, we see that the mag- 
nitude of the complex velocity w is the 
same at any two points on the circle which 
are symmetric with respect to the y-axis. 
From Bernoulli’s equation, it follows that the 
pressures are the same at two such points, 
so that the resultant of all the pressure forces 
vanishes. The concept of a simple potential 
flow thus leads to a paradoxical result in 
case. The reasons for this have been given above (p. 198). 





4. General Theorems concerning Vortex Motion and Circulation* 


(a) Distinction between vorticity and circulation 
We call the line integral (j) v ds, taken along a closed curve C , the 
circulation K along the curve. In view of Stokes’s theorem, 


(j) vds = f(cui\v)dS, 


we might at first sight expect that in a flow where curl v is zero every- 
where the circulation along every curve would have to vanish. But, 
as has already been seen in the example of circular flow about a point 
O f where 0 itself is excluded (fig. 1, 6 above), the circulation may, in 
certain cases, differ from zero even when the flow is irrotational. This 
contradiction arises because the region is no longer simply connected 
after we cut out the point 0, i.e. there now exist closed curves which 
cannot be closed in to a point without passing out of the region. These 
are all curves which enclose the small oirole drawn about 0. In the 
proof given of Stokes’s theorem (p. 27), the simple connectivity of the 
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tt-space was tacitly assumed. Hence, in the two-dimensional case, if 
we make the region multiply connected by excluding a portion of the 
plane, the circulation can be different from zero, despite the fact that 
the flow is irrotational in the region under consideration. Stokes’s 
theorem may be retained, however, if the excluded region is replaced 
by suitably disposed vortices, i.e. if we take a certain distribution of 
curlz; on the boundary of the region. This procedure is especially 
important in the three-dimensional case. 


(6) Thomson'' s Theorem on the conservation of circulation 

Consider the particles of the fluid which form a closed curve C 0 at 
time t = 0. Since we rule out discontinuities in the body of the fluid, 
particles originally adjacent will remain so as time goes on, i.e. these 
particles will continue to form a closed curve which, however may 
undergo changes in shape. The location of a particle on a curve C 
may be given by a single parameter A, say the length of arc. We now 
write the circulation along C 0 : 



Differentiate with respect to the time. Since A has nothing to do with 
time, the order of differentiation with respect to time and with respect 
to A may be reversed, and we have* 


dK 

dt 



dv dr 
dt d\ 


_ r dv dr 
“ Jo. dt dX 




We now assume that the external forces are conservative, i.e. 



grad[Z7 -f /(p)] = 


grad V . 


Then, since (grad V ) 


dr = dV 
dX~ e/A’ 



Inasmuch as the integrand functions V and 2 are definite functions 


* Although two variables t and A occur here, total derivatives are written, 6inoe 
the partial derivatives d/d t were used in another sense in § 2 (p. 195) above. Here 
we are concerned with the total change of v in time, not merely the change occurring 
wheu the position is held fixed. 
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of position, the integrals taken along a closed curve C 0 vanish, and 
we have 

^ =0 or K — const (35) 

at 

The circulation is constant for a closed curve composed of a given set of 
fluid particles if the external forces are conservative. This result wag 
given by Sir W. Thomson (Lord Kelvin). 


(c) Helmholtz's Vortex Theorems 

A number of important theorems, found by Helmholtz in another 
way, may be derived from the Thomson theorem. Imagine the field 
lines of the vector curl v drawn, i.e. the curves whose tangents at any 
point of space give the direction of curl v, or the axis of rotation to 
of the fluid particles. In order to obtain a measure of the strength as 
well, we assume that a number of field lines proportional to curlt) 
pass through every square centimetre of a surface normal to curlt). 
These field lines will now be called vortex lines. 

From the general theorem of vector analysis 

div curl v = 0, 


we conclude that the vortex lines can neither begin nor terminate within 
the fluid; thus they are either closed curves or, in the case of a bounded 
fluid, they begin and end on the surface. 

A tube-like surface made up of vortex lines is called a vortex tube. 
If the cross-section of this structure is so small that curlt) may be 
considered constant over it, we speak of a vortex filament. For such 
vortex filaments we have the theorem that the product of the cross- 
section by the magnitude of curl v — the so-called vortex strength — 
is constant along the filament; for if we take the surface integral in (a) 
over a portion of a vortex filament bounded by two cross-sections 
A x and A 2 normal to curlt), the contribution of the lateral surface 
drops out, since d8 is normal to curlt) everywhere on this surface, 
and there remains only the contribution of the two ends A x and A 2 . 
Since no vortex lines can originate on the inside, and since curlt) 

has the same direction as dS at one end, and the opposite direction 
at the other end, 

,g| , | curlt) | x A x = | cuiId | a A % (36) 

Further, we can show that every vortex filament always consists 
same fluid particles, i.e. the filament travels along with the 
)raw a closed curve C (fig. 3) which does not encircle the vortex 
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tube. By Stokes’s theorem, the circulation j) o v ds along this curve is 

r curlv Zer °’ 8 * nce cur ^ v remains normal to dS 

at all points of the surface enclosed by 

-e — *{ _\ C. According to Thomson’s theorem, 

the circulation along a curve composed 
of the particles which constitute the curve 
Flg - 3 C at time t must remain zero. This 

means that the curve C remains on the 
surface of the vortex tube. Since these considerations are valid for 
every curve on the lateral surface which does not encircle the tube, 
we may imagine the entire surface made up of elements bounded by 
such curves, the circulation being zero along their boundaries. This 
means, however, that the vortex tube consists always of the same fluid 
'particles. 


This law holds for vortex filaments also, since a filament may be 
thought of as the common element of two vortex tubes. The motion 
of vortex filaments can be observed, for example, with smoke rings 
produced by blowing tobacco smoke from the mouth. The smoke 
particles serve to make the motion of the air visible. The vortex ring 
moves onward with the air expelled from the mouth. Rotation of the 
air around the axis of the ring may be observed only in the first stages 
of the motion, since it is soon destroyed by the viscosity of the air, 
which has been left out of account here. 

We now consider a curve R which encircles the vortex filament. 


The circulation along this curve is equal to the vortex strength of the 
filament, by Stokes’s theorem. Now by Thomson’s theorem, the cir- 
culation round this fluid curve is constant in time, so that the vortex 
strength must be constant also. We thus obtain the theorem: In a 
frictionless fluid , the strengths of the vortex filaments are constant in time , 
and hence vortices can be neither created nor destroyed. 


The remarkable properties of circular vortex rings — their indestructibility, 
indivisibility, &c. — led Sir W. Thomson to formulate the ingenious theory 
that atoms are vortices in the ether. This idea was developed by Sir J. J. Thomson 
in an Adams Prize Essay (1883). The matter is only of historical interest now. 


(d) The law of Biot and Savartfor the flow generated by a 

vortex filament 

If the vector curlz; is given as a function of position, the field of 
flow may be calculated from it. We take the simple case of a single 
vortex filament of strength K. This problem corresponds exactly 
to the calculation of the electromagnetic field of a thin wire carrying 
a current I, since for a curve of area A encircling the vortex filament, 
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if we integrate along the profile of the body, we have, since the obstacle 
is bounded by stream lines, 

dy = Vy 
dx v x ’ 



or v x dy — v y dx — 0 (39') 

Consequently [cf. equation (27) (p. 201)], 

wdz. (40) 

The evaluation of the complex integral becomes particularly simple 
if we use Cauchy’s Integral theorem. Since the disturbance vanishes 
at a great distance from the body, i.e. the flow passes over into the 
undisturbed uniform flow, the complex velocity must be representable 
by a series in powers of 1/z, the point z = 0 being within the body: 


K = (f) v ds — (f) [Vjdx + Vydy + i{v x dy — v v dx )] 



According to Cauchy’s theorem (p. 72), we then have 




(6) The Kutta-J oukowski lift formula 

We can also calculate very simply, by means of complex functions, 
the resultant pressure exerted by a stream upon an obstacle. The 
force acting on unit length of a cylindri- 
cal obstacle is given by 

F = j) pnds, . . . (43) 

where n is the direction of the normal 
directed toward the interior of the 
body (fig. 4). Since the normal com- 
ponent of the velocity vanishes at the 
surface of the obstacle, the integral is 
taken along a stream line, and we can 
apply Bernoulli’s formula 

p+?v 2 =k, 
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The second integral on the right vanishes, however, for a plane curve, 
as may be shown readily. Thus, if we call the unit tangent vector t, 
and the unit vector normal to the plane of the curve b , then n — [bt\ 
whence 


£knds = Je (j)[(bt)]ds= k b^tds = k b (j) ds =0, (45) 


since (j) ds vanishes for every closed curve. In order to evaluate the 
first integral, it is convenient to introduce a complex number 

Z = F v + iF Xi (46) 


whose radius vector in the complex plane is thus a reflection of the 
force vector in the line bisecting the first quadrant. Now, by equations 
(43) and (44), 

F y = — ^ j) v* cos ( ny)ds , F x = — £ £ v 2 cos (nx)ds. (47) 


But from fig. 4 we see that 

ds cos (ny) — dx, ds cos (nx) = — dy. . . (48) 

We thus obtain 



\ <£ Adz — idy) 

| <£ far 2 + v v 2 ) (dx — idy). 



If we add under the integral sign the expression 

2 i(v x dy — Vydx) ( v x — iv v ), 

which is zero, by equation (39') (p. 208), we obtain 

Zz= ~2§ ~ 2iVxVv ~ ( dx + id y) = — ^j> w 2 dz. (50) 

Since the disturbance caused by the obstacle vanishes at a great dis- 
tance from it, the complex velocity w may be represented, according 
to equation (41), by a series of descending powers of z, if we take the 
origin within the obstacle: 

+7 1 +i?+ (41) 

The corresponding complex potential is, by integration, 


^ =fl 0 + w oo 2 +fl-i logz — a ^ 2 -f 

z 


• • • • 


• • (41') 
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By equation (42), the coefficient a_ x is connected with the circulation 
by the equation 

K 


0-1 2iri 


If we form the integral j)iv 2 dz } we are concerned only with the term in 

1/z. In squaring w t this term obtains the coefficient 2w oo a_ 1 and the 
value of the integral becomes 


(j) w 2 dz = 2ni 


.2 K 

2t ri * 


(51) 


whence, by (50), 


Z = — pKw M = —pK {v Xm — iv v J. . . . (52) 


From the definition of Z, in (46), we thus have 

F x= I(z) = +pKv Vx 

F v = R{z) = — pKv Xx> 


) 


(53) 


These equations constitute the Kutta-Joukowski Lift Formula, 
so named after its discoverers. It shows at once that the simple poten- 
tial flow about a circular cylinder causes no resultant force on the 
cylinder, since the complex velocity 


w = v — 


vR 2 


2 » 


which, by equation (32) (p. 202) occurs in that case, contributes no 

term in 1/z to the integral £ w 2 dz. In reality, however, the breaking 

away of the boundary layer from the cylinder causes a vortex region 
to form behind the body, and this invalidates our calculations. The 
turbulent detachment of the boundary layer can be checked by setting 
the cylinder into rotation. In this case we have, superimposed upon 
the solution (32) (p. 202), another flow which streams around the 
cylinder in circular paths. The corresponding complex potential has 
already been encountered in § 3 (p. 202), and is A logz. The complete 
flow is then given by the potential function 

R 2 \ 

— ) + A logz, .... 

and the complex velocity 

(55) 




2 + 


(54) 


w = v — 


The circulation is 


K = 2mA , i.e. A = 


K_ 

2m 


(56) 
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If we assume complete adhesion of the boundary layer, the circulation 
has the value 2? tcdR 2 , where <d is the angular velocity of rotation of 
the cylinder and R its radius. If the original 
flow is along the x-axis, we have a force in 
the direction of the y- axis, 

F y = — 277 ptoRh) (57) 

The force is directed toward the side on 
which the stream and peripheral velocities 
have the same direction, since K is positive 
when the surface is on the left.* This pheno- 
menon is known as the Magnus Effect, after 
its discoverer. This somewhat unexpected 
transverse force has recently had an interest- 
ing practical application in the rotor ship. 

By means of relatively simple trans- 
formation functions the flow pattern of 
a combined parallel flow and circulation about a circular cylinder 
may be converted into the flow about a profile similar to the 
wing sections used in aeronautics. According to the magnitude of 
the circulation about the cylinder, we obtain flow patterns of types 
o or c (fig. 5). Case b represents the transition. In case a, where the 
circulation is small, the flow would have to pass around the trailing 
edge to the upper surface; in case c, the flow would be required to 
leave from the lower surface. Since it is an empirical fact that a fluid 
does not pass around sharp corners in this way, the circulation which 
actually occurs must be that corresponding to case b, where the fluid 
leaves the profile smoothly at the trailing edge. Thus the lift of a given 
wing section may be calculated in advance if the section is obtainable 
from the circle by conformal transformation. It is to be remembered, 
however, that the two-dimensional computation is a somewhat rough 
approximation to the actual case. 

ia {T y be t 666 h0 ? the creation of circulation about the profile 

IS to be reconciled with Thomson a theorem concerning the constanov of the 

circulation. If we imagine the wing seotion brought into an ordinary parallel 
stream for whioh no circulation exists for any ourve then nn 
come about according to Thomson’s theorem VuT- ^dv emphaeS Z 
faction at the boundary layer cannot be neglected, even for l fluid of low’ vis- 
cosity. As a consequence, Thomson’s theorem may be applied only to a fluid 
urve enclosmg the body and at a great distance from it. Thc obcukUon remains 

m o h DDosi^ e 'd- TO S* St T deg r “ a circulation develops about the 
profile, an oppositely directed vortex-the so-called starting vorUx f -develops 

T «6T. An/ahrwifUL 
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at the trailing edge. When its strength becomes equal to the final value of the 
circulation about the profile, this vortex is shed downstream, and the flow pattern 
remains steady thereafter. On account of the presence of both this vortex and 
the circulation about the profile, the circulation remains zero about a curve 
which encloses the profile and is at a great distance from it, since it encloses 
both the vortex and the circulation about the profile. 

6. Undulatory Propagation of Disturbances in Fluids. (Sound Waves.) 

Compressions and expansions which are propagated in liquids and 
gases in the form of waves are among the non-stationary kinetic 
phenomena lending themselves most readily to mathematical treat- 
ment. The reason is that the velocities and density changes which 
occur are so small that products of these quantities may be neglected. 
For this case, then, we may strike out the term v grad . v from the 
equations of motion (p. 195). In the absence of external forces these 
equations then become 

(58) 

Let the density be represented by 

p= p[l + cr(t, x, y, z)\ : (59) 

where p is the average density and a is a measure of the departures 
from this normal value. Putting this expression into the equation of 
continuity (12) (p. 196), and neglecting terms of the second order, we 
obtain after division by p: 

— -f- divz; = 0 (60) 

ot 

Since (58) is equivalent to three scalar equations, we have four equa- 
tions for the five variables v x , v v , v z , p, a; thus we need one more 
equation. This comes from the relation between p and p, or between 
p and cr. For gases, Boyle’s Law 

P = P 
V P 

(p = average pressure) suggests itself. However, this leads to erroneous 
values for the velocity of sound, the reason being that Boyle’s Law 
holds for constant temperature, while the compressions and expan- 
sions follow one another so rapidly here that the heat generated ac- 
cording to the laws of thermodynamics (cf. p. 506) cannot be dissi- 
pated sufficiently rapidly. We must apply the adiabatic law (p. 506), 
which holds for complete thermal insulation. In terms of the density, 
which is inversely proportional to the volume, this law is 
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| = (0 = (1 + < t )* = 1 + y°, . . ■ (61) 

where y is the ratio of the specific heat at constant pressure to that 
at constant volume. Then 

grad p = py grad cr, (62) 

and, again neglecting products of small quantities, 

- grad p = grad a (63) 

P P 

If we substitute this value in the simplified hydrodynamical equation 
(58), we obtain 

S = -y^ dCT ' < 64 > 

It is easy to eliminate v from equations (60) and (64). Since deri- 
vatives with respect to position and time may be taken in either 
order, we need only differentiate equation (60) once more with respect 
to t, take the divergence of equation (64), and equate the two 
resulting expressions for 

si (div *>• 

The result is div grad a — Aor = (65) 

py dt 2 ' 

But according to p. 63, this is the differential equation of a wave 
motion propagated with the wave velocity 

< 66 > 

Thus we have the formula for the velocity of sound in gases. The 

average density p is still a function of the temperature. By o 494 
we have 

?= r+^’ <«*> 

where p° is the density at 0° C., 6 is the centigrade temperature, and 
a is the coefficient of expansion of the gas. Hence 

frx 1 + 

po (68) 

Since p„ is proportional to p, w is independent of the pressure. 

eibiEty Lg“ ** * the ““P™* 
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7. Hydrodynamics of Viscous Fluids. 

(a) Simple linear laminar flow; Poiseuille's Law 

The term “ laminar ” is applied to a flow pattern in which the 
fluid may be divided into parallel layers which flow past one another 
with various velocities. The simplest case is that where the velocities 
of all layers have the same direction, say that of the x-axis. As a 
result of internal friction (viscosity), the more rapidly moving layer 
tends to drag the neighbouring layer along with it, and thus accelerates 
it. Reciprocally, the slower layer tends to retard the faster one. The 
correct definition of this force goes back to Newton. The force is 
proportional to the common area of contact S of the two layers and 
to the velocity gradient normal to the direction of flow. Thus if the 
velocity v increases along the positive z-axis, the layer just above 
that for which z = 0 exerts a tangential force upon an area S of the 
latter of amount 

m 

In this case the force is in the direction of the positive x-axis, and 
thus is to be reckoned positive. The factor of proportionality rj is 
called the coefficient of viscosity. Laminar flow occurs, for example, 
when a viscous fluid flows slowly through a circular cylindrical tube. 
From experience we know that the liquid adheres to the walls; hence 
the velocity increases as we go toward the axis of the tube, and dv/dr 
is negative. Take the element of volume to be a hollow cylinder of 
fluid with radii r and r -f- dr. The adjacent layer on the inside exerts 
a force on this cylinder which is along the positive x-axis and amounts to 

F I = -2nr,lrp r , 

where l is the length of the tube. On account of the decrease of velocity 
toward the outside, the contiguous layer of fluid on the outside tends 
to retard the cylinder with a force 

F * = + 2 ”vHr + dr) (~ r + ^ dr^. 

If the pressure at the inlet end of the tube is p x and that at the outlet 
p 0 , then the resultant hydrostatic force on the ends of the volume 
element is 2nrdr(p 1 — p 0 ). In the stationary state corresponding to 
uniform motion the pressure force and the forces of viscosity must be 
in equilibrium. Hence, neglecting higher order terms, 

2v4 d £ + 277-7) Zr ~ = 2,77)2 i (r j£j = -2 m{ Pl - p 0 ). (70) 
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This equation may be integrated at once: 



(Pi - Vo)* 

2 rjl 




Integrating a second time, 

« = - iPl ^i Po) *+C logr + D. . . . (72) 

The constants of integration C and D are determined in the following 
way: Since v must remain finite along the axis (r = 0), C must be 
zero. Moreover, v must vanish at the wall, where r = a. Hence, 



The solution is thus 




^ the velocity vectors of individual fluid elements are drawn with 
their initial points in a given cross-section, their termini lie on a sur- 
face of rotation whose meridian section is a parabola. In this case 
we speak of a parabolic velocity profile. The total volume W issuing 

per second is obtained by taking the integral J vdS over a cross-section. 
In this case we have 


w = f 2i-ZXo {a 2 __ r2) 2indr _ - Po)«« 

J o W 8r,l 


(74) 


This is Poiseuille’s Formula, which states that the quantity of fluid 
issuing each second is directly proportional to the pressure differ e n ce 
and to the fourth power of the radius of the tube, and inversely pro- 
portional to the length of the tube and to the coefficient of viscosity. 

Simple laminar flow, however, is not the only form which can 
develop m a tube. At greater velocities the flow breaks down into 
another state where (1) the velocity of efflux is no longer proportional 
to the pressure difierence, but increases more slowly, (2) the values 
ot velocity and pressure at a given point are not constant in time, as 
before, but oscillate about mean values, and (3) the paths of individual 
particles are no longer rectilinear; there is a continual intermixing of 
fluid elements, as may easily be demonstrated by injecting filaments 

f IvT* hqmd ' Thl3 state of flow “ called turbulence. The initiation 

lTr Ce ,i S , dete ™ ked h Y the value of a dimensionless quantity 
called Reynolds s number , after its discoverer. This quantity for a 
circular tube of radius a, is given by 7 

R = ^P av 


‘ n 


. (75) 
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whore p is the density of the fluid and v is its velocity. There is a 
rather definite lower limit of the value of R corresponding to the 
beginning of turbulence. Turbulent flow has not been produced for 
values of R below about 2200. Whether or not turbulence appears 
for values of R above 2200 depends upon the degree to which the fluid 
is free of disturbances, especia ly at the inlet. The theory of turbulence 
and of the breakdown of laminar flow is still an incompletely solved 
problem of hydrodynamics. 


(6) Extension to arbitrary states of flow 


In order to calculate the stresses on a volume element of a viscous 
fluid we can avail ourselves of the formulae of the theory of elasticity, 
provided we bear in mind that in place of the displacements u, v , w 
(p. 162), their time derivatives appear, since the viscous forces operate 
only when motion occurs. But these derivatives are merely the com- 
ponents of the flow velocity. Thus we must put into equations (21) 

(p. 166), 




dv v 

dx 






We shall assume, in what follows, that the fluid is incompressible; 
hence e 11 -f- e 22 + e 33 = 0. We must now introduce the coefficient of 
viscosity 77 in place of the elastic constants E and <7. To see how this 
is to be done, we return to the above special case of linear laminar flow. 
Let the flow now be along the y-axis, and let the velocity increase in 
the direction of the positive x-axis. Thus 

v x = 0, v v = v, v t = 0 (77) 


The elastic equations of p. 174 give the shearing stress on unit area 
normal to the x-axis, acting in the direction of the y axis, as 



E _ 

2(1 + o) 612 - 


E dv 

2(1 + cr) dx’ 



while equation (69) (p. 214) gives 


dv 
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for this quantity. Thus - - - — - 

2(1 -j- a) 

is to be replaced by 17 , and we have 


p 9 dv x 

2v dx 

s 

II 

/dv x dv v \ 
\dy ^ dx) 

II 

j 


p 9 7 ) ^ Vy 

F*2 ^ dy 

to 

CO 

II 

-3 

(dv v dv 9 \ 
K dz + dy) 

ii 

CO 

to 

• ( 79 ) 

2 * 

09 

II 

cal 

F$i — V | 

{dv,, 0»*\ 

dz ) 

1 — As 

4 



According to p. 170, the components of the resultant of the stress 
forces per cubic centimetre become 



dPn 

dz 


— div Pj = 2r) 


dx 




^ 4 - * v * , 

dy 2 dxdy 


Z 2 Vz . 
dxdz ' 




or 

since div v = 0. 


F x — rj(kv x -f g- div v) = t]Av 
Similarly, 



F v = r) Av v \ 
F g = t]Av k j 



If these forces are added to the previously considered forces acting on 
unit volume of a fluid, we obtain the additional term n Av in the vector 
formulation. But by the familiar vector formula 


curl curl v — grad div v — Atj, 

the viscosity term tjAw becomes simply — tj curl curl v, since div v = 0 

f j b ° Ve ’ the , external force P er gramme by 6, the funda- 
b^rames hydr ° dynamical equation for incompressible viscous fluids 

dv i 

dt V grad# v = 0 P-'p curl curl v. . (82) 

this h « e ^ reSS - ea i Ue t0 vi8COsity are aUowed t0 remain in the equation, 
tins expression becomes, using (80), ^ 

dv i i 

dt +VgKi.v=Q-- grady + - (t div P, +j divP 8 + k div P 8 ). (83) 
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(c) Stokes's Law for the fluid resistance to a moving sphere 


Stokes was the first to succeed in determining the resistance ex- 
perienced by a sphere moving uniformly through a viscous fluid. 
Naturally, it amounts to the same thing if we consider the sphere 
fixed and the fluid to be flowing by it with the opposite velocity v 0 . 
We are thus concerned with finding a special solution of (82) which 
corresponds to the flow about a spherical obstacle and calculating 
the resultant force acting on the sphere from the flow pattern obtained. 
Stokes assumed the speed to be so small that the expression v grad . v y 
containing the products 


could be neglected. That is, in the stationary state — which alone 
interests us — the left member of (82) may be set equal to zero. This 
approximation, however, leads to erroneous results in the case of a 
cylinder, as Stokes himself pointed out. As Oseen has shown, the 
term v grad . v must be taken into account, even for low speeds, at 
least to the extent that it is included in the form v 0 grad . v — i.e. the 
flow v is replaced by the undisturbed flow v 0 except in the derivatives 
of velocity. For the sphere, this furnishes only a small correction term, 
Stokes’s analysis being thus justified, and his result verified. For the 
sake of simplicity, we shall therefore give a brief reproduction of 
Stokes’s mathematics. Since we are concerned only with viscous 
forces, we leave the external force — the force of gravity — out of con- 
sideration. Its only effect is to produce a hydrostatic buoyancy. We 
proceed, then, by replacing all forces not connected with viscosity by 
the hydrostatic forces, since the velocity is low. However, in computing 
the total force, these contributions must not be forgotten. Thus, in 
the problem of a sphere falling through a viscous liquid we must not 
forget the buoyant force. In order to calculate the viscous forces, we 
thus consider the fluid freed from gravity, and the hydrodynamic 


equations reduce to 

gradp=7jA^; div v = 0. 



We try a solution for v which is the sum of a potential flow v x — grad O 
and a flow v 2 not derivable from a potential. Then 

grad p = t?A grad O -j- rjAv 2 = rj grad AO + t)Av 2 . (85) 

We further assume that v 2 satisfies the equation Av 2 = 0. Integration 
of (85) then gives 


p = 77AO + p 0 


( 86 ) 
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The condition of incompressibility, div v = 0, furnishes the additional 
equation 

div grad O -f div v 2 = AO + div v 2 = 0. . . (87) 


The simplest assumption we can make regarding v 2 is that there is 
only one component v 2x different from zero. This quantity must then 
satisfy the equation Av 2z = 0. The simplest solution of this equation, 
corresponding to spherical symmetry, is 


Thus we put 




. ( 88 ) 
. (89) 


According to (87), this gives the following differential equation for O: 


si 

AC + a 1 = 0 

dx 



We may verify in a moment that a particular integral of this equation is 


for this gives 


C = _? 

2 dx' 


(91) 


= - Is* - - \k a***' - - * F 


a- 


= _aa^/3_r 2 \ _ r 

2 3x\r r 2 / a dx 


In order to have sufficient constants to satisfy the boundary con- 

AC°-b%Wth solution functions which satisfy the equation 

Thus these Motions drop out when substituted in equation 
(90). Since our previous particular solutions vanish at infinity we must 

add a term which fulfils the condition that the flow at a great distance 

from the sphere is parallel to the x-axis. A suitable term is c./r Another 

function which satisfies AC = 0 is, for example 6 ^ ButfartW on 
account of the interchangeability of the operators d/dx and a’ all 
derivatives of 1/r also satisfy the equation AC = 0 We shall show 

thTsSton^ C ° nditi0n8 ° an be 8ati3fied b 7 ^ding 

3 - 
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our assumed solution is thus 


v = grad 




a . 
- t. 

T 



Written in component form this is 


-A 


, , r a d 2 r a . , /3x 2 



1 

G*- 


02 


= b T - ° 82 f =h^^ a X 1 
v cxcy 2 cxoy r 5 2 r 3 * 


c 2 - 


^1 

a S 2 r 

0X02 2 0X02 


, r a c-r , 3x2 , a xz 
*•= 6 — - - — = 6 7T + o j»- 



For r = R all components of z; must vanish for all values of x, y, z. 
This is true for the y- and the 2 -components if we put 




Then the coefficient of x 2 in the x-component vanishes of itself, and 
by putting the remaining terms equal to zero we obtain 


a = — v. 


3R and b^ V ° R3 


0 2 ’ " 4 

The solution corresponding to the boundary conditions is then 

/. 3 R ,7? 3 \ 3 v 0 R(. R*\ 

3t’ 0 /?/ 1 R 2 \ 


(94) 


3 v 0 R/. R 2 \ 

V 2 — V (1 z ) XZ 

2 Ir 3 \ r 2 ) 


By equations (86), (90), and (94) the pressure becomes 

^ r 3t’ 0 rjR 

P = Po + = Po - ar l fa = Po - Y r 3 X ' 


(95) 


(96) 


The resultant force acting on the sphere is obtained by evaluating the 
stresses from the velocity components by equation (79) (p. 217), and 
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taking the integral of the stresses and of the pressures over the sphere. 
We thus have (cf. p. 168): 

F=— £pdS-\- <£ {Pj cos(m‘)-{- p 2 cos(«/) + P 3 cos(w&)}dS, (97) 


the pressure having the negative sign (see footnote, p. 191). On account 
of symmetry, the resultant can have only an x-component. The value 
of this is 


F g — F = — £ pidS 

+ § {Pu cos (ni) + P 21 cos (nj) -f- P 31 cos (nk ) } dS. (98) 


Because of the symmetry of the stress tensor (P 12 = P 21 ), we have 
{^n cos (ni) + P n cos (nj) -f P 31 cos (nk)}dS = P^S, (99) 

and so the resultant becomes 

F = (f ( p i - pi)dS (100) 

This integral is to be taken over the sphere of radius R. We can save 
much calculation, however, by noting that the divergence of (P, — vi) 
vanishes. For 

div (P x — pi) = div P x - 

but this expression is zero, according to our assumptions, as may be 

seen from the component form of equation (83) (p. 217). Application 

of Gauss s theorem then shows that the integral has the same value if 

taken over a concentric sphere whose radius is allowed to become 

infinitely large. Since the surface element of this sphere increases as 

the second power of r, we can drop all terms of higher order than 1 /r, 

e.g. terms like x 2 /r 5 , in (92). For r large, the velocity components 
then simplify to 


. . a . a z £ 

V *°> ~ v « + 2r + 2 ?>’ 


V — ° 

"OD ‘ - 


a xu 
2 r 2 ’ 


v 


a xz 


*<» 2 r 3 * ) 


Here a stands for — § v 0 R. In order to calculate the stresses we need 
the derivatives 



• • 


( 101 ) 
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Then by equations (79) and (96), 

F = £ (Pi — pz)d8 



£ / 1 * a vy* 2 

r 3 \ f 2 / r 5 ^ 


3ar)ZX 

~ r* 


2 


k~ 


i (po+ a ? d* 



O 

— £ Sd-q ^ tdS— Sd-q^jdS — £ 3 aq Z ~ kdS — j) p 0 id8 


= ~ 3fl7 ? (f ^ ( xi + yj + zk') dS — £p 0 idS 

o 

= — 3a^ j) ^ rdS — (f) p 0 idS (102) 

The last integral vanishes on account of the constancy of the integrand. 
In the first one, 

dS = rr sin0d0d<£,| 


= cos 2 6, 


) 


(103) 


so that 


F = -Sai/ r I' 2 ” cos* 6 sin 0d6 d<j> = + ["cos 3 el” 

J 0 J 0 3 |_ Jo 

= — 47 Tdr) (104) 

Inserting the value of a as given by (94) (p. 220), we obtain the final 
formula 

F = 67 rqv Q R (105) 


The force is thus directly proportional to the radius of the sphere and 
to the relative velocity of sphere and fluid. This law is of especial 
importance in the study of the atomic nature of electricity (cf. pp. 418 
and 425). The law has been well substantiated for spheres falling with 
low velocities in viscous fluids 


8. Surface Tension of Liquids. 

(a) Definition of the surfoce tension constant 

A large number of familiar phenomena — for example, those as- 
sociated with soap bubbles — indicate that the surface of a liquid 
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Fig. 6 


behaves like a stretched membrane, and that work 
must be done to increase the area of the surface. 

What causes this tension? In order to explain the 
cohesion of liquids we must assume that the mole- 
cules are subject to mutually attractive forces which 
decrease rapidly as the distance increases, and are 
therefore of little effect in the gaseous state.* A group 
of molecules shielded from all external forces would 
thus assume the configuration in which the particles are at the mini- 
mum distance apart, i.e. a sphere. Every deformation of the sphere 
enlarges the surface and increases the average distance apart of the 
molecules, thus requiring that work be done against the attractive 
forces. If we pursue these ideas mathematically, and calculate the 
total potential energy of a quantity of liquid, we obtain a part 
proportional to the volume and another proportional to the surface. 
Since the former increases as the cube of the linear dimensions, the 
latter as the square, the surface phenomena will be more prominent 
when the volume considered is small. 

We begin by considering the surface energy 


U = aS. 


(106) 


The factor of proportionality a is called the surface tension. If we 
mark off a portion of the surface (fig. 6) by means of a closed curve C, 
then a force r, lying in the surface, acts on each unit length of this 
curve. Let the curve C pass into the curve C' when the surface is 
enlarged. . Let 8n be the displacement of an element of arc ds 
in the direction of the normal to C lying in the surface. Then 
the work done against surface tension m displacing the curve C is 

— SfP = ^ T <ZsSn (107) 


Since t has the same value at all points of the surface, as in the case 
of a membrane, it may be brought outside the sign of integration. 

Also, since SdsSn is equal to the increase in area, we have by (1061 
and (107), 

= ~ W = tSS =aSS (108) 


The surface tension a is therefore equal to the force 
length of a bounding curve. 


r acting upon unit 


of ft *^’i! i i tatiOIia ii atfcr ? 5tion “ uat 1x5 assumed to exist between moleoules but it 
of a muoh lower order of magnitude than the true moleoular forces which a™ ailt * ? 

iS/™ d °. r tato Play 
together. It is these forces which cause the moleoules to oooupy space. ^ 
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(b) Normal 'pressure at a curved surface. Differential 

of the surface 


equation 


We consider first a cylindrical surface (fig. 7) and consider the 

forces acting upon a surface element 
bounded by two generators and two 
curves normal to them. Since the ten- 
sions acting on these two curves lie in 
the surface, they are parallel to the 
generators, and hence can produce no 
resultant normal to the surface. On the 
other hand, the tensions acting on the 
generators form an angle d<j> with one 
another, and have components (fig. 7) 
directed toward the concave side of the 
surface. Calling the length of the 
generators ds v that of the other bounding curves ds 2 , the inward 
component of the force acting on each generator amounts to 



Fig. 7 


7 7 p ds-i dsn dS a dS 

"'-ir-TT' 


i.e. 


a 

= 


(109) 


For a surface of double curvature, we bound the surface element by 
two pairs of neighbouring lines of curvature. The lines of curvature 
are characterized by the fact that neighbouring normals to the surface, 
at points along these lines, intersect each other, and that the directions 
of their tangents are those of the normal sections of greatest and least 
curvature of the surface. Since the two families of lines of curvature 
are orthogonal, the above considerations concerning the cylinder are 
valid for each pair of sides of an element of a doubly-curved surface. 
The total normal force is then 


dF 


-(bib 


and the inward-directed normal pressure is thus 



( 110 ) 


The principal radii of curvature and R 2 are to be considered posi- 
tive if the corresponding centre of curvature lies within the fluid. The 
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normal pressure may assume negative values for concave surfaces, 
i.e. it becomes a tension. 

To compute the equilibrium form of 
the surface we apply the principle of 
virtual displacements. Imagine the surface 
deformed in such a manner (fig. 8) that each 
element is given a displacement Bn in the Fig. 8 

direction of its normal. Calling the dis- 
placement positive when directed outward, the work done against 
the normal pressure caused by surface tension — which is equal to the 
increase in potential energy — is 

w ' - 1- a + b- « ,s ”- • ■ ■ »“> 



In addition to this change of potential energy there is another change 
connected with the potential of the external forces, in particular 
gravity; for fluid may be transferred to points of different potential 
as a result of the displacement. Since the energy-changes accom- 
panying the virtual displacements are small compared with the actual 
differences of potential along the surface, we may, in calculating this 
change of potential energy, take the values for the undeformed sur- 
face, say <D per unit mass. Then this part of the energy change is 


SCI 2 = J p<&dSBn. 


. . ( 112 ) 


Altogether we have, for equilibrium. 


BU 



Ri 


+ 


dSBn -f- J p$> dS 



dS Bn =s 0. . (H3) 

The displacements Bn, however, are not entirely arbitrary Thev must 
be so related that the volume of the liquid is not altered' 
change m volume is given by the integral f dSBn which mw * ^ 
Hence the vanishing of the integral ofequationVll^ dn * 
us to conclude that the integS ia zero ^ ^ “ ot f aU ° w 

(114) 


SF = fdS8n = 0 


7 determined multiplier A, which we may write under 

(H 711) 
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integral sign, since it is constant over the surface. We then add this 

equation to (113). Now, from the same considerations as on p. 116), 

we may conclude that the vanishing of the integral implies the vanish- 
ing of the integrand, whence 




-h pO — f— A = 0. 


. . (115) 


oc. 


\ V».-* 

► 1 


This is the differential equation of the surface of a liquid. The con- 
stant A must be determined from the known values of R^, R 2 and 
at some point on the surface. If, for example, we consider the rise of 
a liquid surface at the walls of a vessel, A may be given readily for the 

middle point, where the surface may be considered 
plane. It is also convenient to reckon the gravita- 
tional potential from this point, so that A = 0 in this 
case. 

(c) Boundary conditions 

In order to integrate the differential equation, the 
boundary conditions must also be known. The usual 
case is that in which the liquid is confined in a vessel 
whose walls intersect the surface in a boundary curve. 
Fig. 9 represents a section normal to the bounding curve. 
At the surface of contact between the liquid and the wall 
there exists a surface stress a lf whose magnitude depends upon the 
nature of the liquid and of the material composing the wall.* For 
equilibrium, the component of the surface tension a parallel to the wall 
must balance the boundary surface stress a lf . No condition of equili- 
brium need be satisfied by the normal component, for such a com- 
ponent manifests itself solely by a force against the wall; on the other 
hand, if the tangential forces were not in balance, the hounding curve 
would move along the wall. In the case where the liquid wets the 
wall, the acute angle between wall and liquid surface (fig. 9) — the 
so-called angle of contact — is denoted by 9. In this case 


§ 


Fig. o 


a cos 9 4- a lf = 0, 


or 


cos 9 = — 


/ 

4 

a 


(116) 

(116') 


This means that a lf is negative in this case. If 

'a./ 


a 


>i. 


* Since the surface tension depends upon the nature of the two media, the value 
of the constant a introduced above really is contingent upon the nature of the gas 
above the liquid. However, the differences for various gases are not great. Strictly 
speaking, a is the surface tension of the liquid with respect to its own vapour. 
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there exists no finite angle of contact, and the liquid spreads along 
the wall in the form of a thin layer. This is the case for the contact 
between water and glass. It corresponds to the boundary condition 
0 = 0. The spreading takes place even along a vertical wall. For a 
thin film, the work done against gravity is so small that it sets prac- 
tically no limit to the spreading of the liquid. 

(d) Liquid films 

A liquid film (e.g. the wall of a soap bubble) is bounded by two 
parallel surfaces which are close together. Consequently, the normal 
pressure of a curved film is twice that corresponding to each free 
surface. In the interior of a spherical soap bubble of radius R=R x =R ii 
the excess pressure is thus 

4a 

( 117 ) 


If a bubble of this size is to be in equilibrium, the internal pressure 
must exceed the outside air pressure by this amount. For an unclosed 
film, the total pressure on both sides is the same; hence the part due 

to the existence of surface tension is zero, and the differential equation 
of an unclosed film is 

< 118 > 


The expression on the left is called the mean curvature. Surfaces for 
which this quantity vanishes are called minimal surfaces, since they 
are the surfaces of smallest area for a given boundary curve. It is at 
once evident that a stretched skin, corresponding as it does to a liquid 
film, assumes a form having minimum area. 


Ex. 63 . Calculate the height to which a liquid will rise in a narrow tube of 
radius r u whose wails are wet by the liquid (oos 0 =1). 

Ex. 64 A soap film is stretched between two equal oiroular wire rincs Ivint? 
m parallel planes, then- centres being on a line normal to these plane! Th l 

rotational symm ^ about **■ wU-^: 



CHAPTER X 


Relativistic Mechanics 


1. Space and Time in Newtonian Mechanics. 


When we spoke of the “ path ” and of the “ velocity ” of a particle 
in Chap. V we tacitly assumed the existence of a frame of reference 
(co-ordinate system) in which we could specify the position of the 
particle from instant to instant by giving certain numbers, and further, 
a means of measuring time (a clock) which marked off definite equal 
intervals of time at which the position could be recorded. A co-or- 
dinate system may be given, for example, by the walls of a room, or 
by the positions of the stars and the direction of the plumb-line. For 
a measure of time, we may use any subdivision of the day, i.e. the 
period of rotation of the earth. Actually we find the laws of me- 
chanics, as derived in the preceding chapters, verified when we use 
such reference frames and measures of time, at least to a very good 
approximation. But there are also frames of reference in which the 
fundamental law 



m 


d 2 r 

dX 2 


is not valid. For example, if we set our room, along with its fixed 
co-ordinate system, into rapid rotation, and if we allow the given 
applied forces to remain unchanged, then the paths of particles in the 
rotating co-ordinate system will be different. We can restore the 
validity of Newtonian mechanics by adding centripetal forces corre- 
sponding to the rotation to the other impressed forces. But what is 
the cause of these inertial forces? Newton gave an exact answer to 
this question. Despite the fact that his solution is based upon a fiction, 
it constitutes a foundation of the mechanics of macroscopic masses 
which has served for more than two centuries. Newton assumes that 
there exists a cosmic substratum, absolute space , which “ in its own 
nature, and without reference to anything external, remains always 
similar and immovable Newton continues by saying that relative 
space, on the other hand, is “ some movable dimension or measure of 
the absolute space which our senses determine by its position to other 
bodies, and which is commonly taken for immovable space Thus 

* principia. Book I, Scholium to Definition VIII. English translation by Androw 
Motte, New York, Daniel Adee, 1840, p. 77. 
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the Second Law holds only for reference frames which are at rest in 
absolute spaee or, as we shall see presently, are in uniform rectilinear 
motion with respect to it. The centripetal or centrifugal forces 
involved in rotation are caused by an acceleration of our reference 
frame with respect to absolute space. Thus space, which according 
to Kant constitutes, together with time, the perceptual form of human 
cognition, attains physical reality, and we can perceive its effect in 
the invalidity of Newtonian mechanics for accelerated frames. The 
newer views, on the other hand, lead to a negation of absolute space 
such as the ether of electrical theory. These more recent ideas recog- 
nize only relationships between parts of matter occupying space. 
The cause of centrifugal force is taken to be acceleration not with 
respect to absolute space, but with respect to the material bodies of 
the universe. Thus, if all the stars were removed, centrifugal fore# 1 
would no longer exist. Newton also makes a similar assumption con- 
cerning the second element in the form of our knowledge, viz. time! 
Absolute time is assumed to flow uniformly, regardless of the exis- 
tence of any thi nkin g being or any phenomenon or process suitable 
for measuring it. We see at once that this fiction of an absolute time 
also fails to stand strict examination. We need only propound the 
question: what happens if the velocity of all physical and chemical 
processes, and hence of our thought processes also, is suddenly doubled* 
Since we would have no means of detecting this, the absurdity under- 
lying the very formulation of the question becomes apparent at once. 
Hence the concept of absolute time, which assumes that a standard 
clock exists somewhere in the universe, loses its justification, for it is 
contrary to the spirit of physics to form concepts which are neither 
directly nor indirectly accessible to our senses. Nevertheless , the fact 
that Newton was able to formulate a thoroughly adequate foundation far 

whnTfT^r° f rmJ fl osco ^ 1 b ° dies *>y Educing these two concepts, 
whose difficulty could scarcely have escaped his notice, is to be regarded 
as an outstanding stroke of genius. ™ 

2. Inertial Frames. The Galilean Transformation. 

We start with the empirical fact that there exist reference frames for 

accord with experiment. Cetfor 

which employs a stationary reference frame at the centre ofmurity 

essssgesg 
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tenon as to whether our frame of reference, together with our clock, 
represents an inertial system of space and time? For this purpose we 
perform, at least in thought, the simplest of mechanical experiments — 
rectilinear motion of a particle subject to no forces. If we divide the 
line of motion into equal segments, we can take the time between the 
passing of two successive marks as the unit of time. However, one 
direction is not sufficient. If our measured path were accelerated with 
respect to a true inertial frame, we would obtain a non-uniform clock 
which would give impossible results for other experiments. It is readily 
seen that the necessary and sufficient condition for an inertial frame 
is that three particles projected in non-coplanar directions describe 
straight paths. Then, by dividing the path of any one of the particles 
into equal intervals, we can obtain an inertial measure of the time. 

We now prove that if 2 is shown to be an inertial frame, Newtonian 
mechanics is also valid in every co-ordinate frame 2' moving uniformly 
in a straight line with respect to 2, provided that the same measure 
of time obtains in both systems. We shall find later that this tacit 
transfer of the time unit to the moving system is precisely the weak 
point of these almost self-evident considerations. Let the origin of 
2' move with the constant velocity u . Then if we denote by accents 
all quantities referred to the moving frame, we have 


r= r* + ut (1) 

Differentiation with respect to the time gives 

r=r'+u, (2) 

r=y (3) 


Equation (2) states that the “ absolute ” velocity r is equal to the 
vector sum of the relative velocity r' and the velocity u. This theorem 
is general, since no use was made of the constancy of u in deriving 
formula (2). Equation (3) states that the acceleration is the same in 
both systems, 60 that 

F = mr = mr\ (4) 

which was to be demonstrated. The transformation r = r' + ut 
which, together with the equality of the time reckoning, corresponds 
to the concept of absolute space and time, is called the Galilean trans- 
formation. If we take the i- and the i' - axes in the direction of the 
velocity of motion of 2', whose magnitude is «, and if the two frames 
coincide at time t = 0, then the co-ordinate representation of the 
Galilean transformation is 

x=x'-\-ut , y = y', z=z\ t = t! . . . (1 # ) 

The validity of the Galilean transformation carries with it the 
impossibility, by means of mechanical experiments, of detecting 
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uniform rectilinear motion with respect to absolute space; e.g. we 
could not determine the earth’s orbital motion, which is approximately 
of this nature over a short interval. However, the extension of these 
ideas to electrodynamics (optics) shows that an experiment is possible, 
by means of which the reality of absolute space (the ether) might be 
verified. But experiments designed to test this point yielded negative 
results (see p. 239); this led to a revision of the previous conceptions 
of space and time. 

3. Accelerated Frames of Reference. Free Fall on the Rotating 
Earth. 

We now consider accelerated motions of the frame S' with respect 
to the inertial frame S. Two special cases of particular significance 
are selected for discussion. 


(a) Reference, frames in uniformly accelerated rectilinear motion 

We assume our reference frame S' to move with uniform accelera- 
tion a in the direction of the negative £-axis. Let this frame coincide 
with S at the time t = 0, and let it start from rest. Such a frame is 
represented by a falling box. The connexion between the quantities 


pertaining to S and to S' is then 

r=r'-a t lk (5) 

r= r' — atk, (6) 

r— V — ak (7) 


In S we have F = mr = mr 1 — mak\ hence for the same impressed 
forces in S', 

wF = F -f mak (8) 

In addition to the resultant of the “impressed forces we now have an 
apparent force or inertial force mak. If a is identical with the accelera- 
tion of gravity, the inertial force exactly balances the gravity force 
mgk. Hence the force of gravity does not operate in the falling 
chamber. On the other hand, if we imagine the chamber to be at a 
place where there is no gravitational force, and if we move it with 
uniform acceleration g in the direction of the -f£-axis, then an apparent 
force of amount mg, directed along the negative £-axis, acts upon 
every body within the box. Thus an observer within the box would 
say that he is in the gravitational field of the earth. This equivalence 
of an accelerated frame of reference and a gravitational field is of 
fundamental importance in the general theory of relativity (cf. p. 258). 
lhe equivalence can be maintained, however, only if the ratio of in- 
ertial mass, i.e. the mass entering in Newton’s Second Law, and gravi- 
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tational mass, i.e. the mass occurring in the law of gravitation, is the 
same for all bodies. In this case the ratio may be set equal to unity; 
this determines the numerical value of the gravitational constant. 

Following this preliminary glimpse we turn to the other important 
special case. 

{b) Uniformly rotating frames of reference 

In the previously considered types of motion the unit vectors i\ 
k' maintained their directions, and we assumed these directions to 
coincide with those of i, j, k respectively. For a rotational motion of 
S', however, the directions of these vectors change. If we suppose 
the origins of the two frames to coincide, and if we take the axis of 
rotation through the common origin, we have 

r = xi -f- yj + zk = x'i' -j- y'f -f z’k\ . . (9) 


In taking time derivatives, the unit vectors will thus be variable: 



The first three terms represent the velocity relative to S', the last 
three give the velocity of a point rigidly attached to S'. If we denote 
the angular velocity vector by co, then 



If these values are substituted in equation (10), the last three terms 
may be combined: 

x'[ioi']+y'[<of]+z'[i»k']= M*7' + y'/ + z'k')]=[ior]. (12) 


If we denote differentiation referred to S' by d! /dt, then (10) may be 

Wrltten dr <*>, r n 

* - (13) 


This is true for any vector, since we can insert the components of any 
vector whatsoever in place of x, y, z. Symbolically, we may thus write 


d 

dt 


d' r , 



Differentiating (13) again with respect to t and applying the above 
rule, we have, remembering that to is constant, 
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° r S := ^ +2 [ to i : ] +[to[ ‘ o, ' ]] ' • • (16) 

The first term represents the acceleration relative to S', the third 
represents the acceleration in the system S of a point fixed to S'. 
However, it is no longer true, as in the case of velocity, that the ab- 
solute acceleration is equal to the sum of these two terms; there is, 
in addition, a third term — twice the vector product of angular velocity 
by relative velocity. This term, which occurs only when the particle 
moves in E', is called the Coriolis acceleration , after its discoverer. 
The Second Law then becomes 






+ m[co [tar]], 



or, in the moving system, 

dPr 


m 


dX 2 


= F — 2 m 



— m [to [cor]]; 



that is, we have, in addition to the impressed forces, the ordinary 
centrifugal force — m[co [tor]] and the Coriolis force 



As an example of the forces arising from rotation of the frame 
of reference we take the case of free fall on the rotating earth. Co- 
ordinate frames rigidly attached to the earth are, strictly speaking, 
not inertial frames, on account of the daily rotation. The angular 
velocity of the orbital motion, on the other hand, can be neglected, 
having to the other the ratio 1 : 365 . However, the deviation from a true 
inertial frame is so small that it usually may be neglected. Neverthe- 
less, there are various mechanical experiments which permit the rotation 
of the earth to be demonstrated. These experiments were formerly 
interpreted as proof of the rotation with respect to absolute space; 
from the point of view of the present day, however, they may be 
considered merely as evidence of rotation with respect to the other 
matter of the universe. Take an inertial frame whose £-axis is in the 
direction of the earth's axis. Let the frame S' be oriented as follows: 

th . 6 “ n °l \ he P lumb ^e at the geographical 
latitude <f>. This direction is that of the resultant of gravity and centri- 

fugal force. This direction will not pass through the centre of the 

earth if we consider the earth to be spherical, but it is sufficiently 

accurate to take the angle between this direction and the earth’s axis 
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We also omit from consideration the polar flattening of the earth. 
The positive sense is taken to be that pointing outward from the earth’s 

a centre. The /'-axis is directed south- 

/ \ I ward, the j ‘ '-axis then points eastward 

/ w J h' (fig. 1). We then have the equation of 

'T/I motion 


d 2 'r 
6X 2 


= — mgk' — 2m 


d'r 


( 18 ) 


I t I V'/ \J 

/ L- ~7~ The centrifugal force is already included 

in the term —mgk', since we understand 
1 g to represent the resultant acceleration 

Flg - 1 of a falling body at the point considered. 

The angular velocity to has the com- 
ponents —o)i' cos <t> and -\-cok' sin</> in the frame 2'. In co-ordinates 
we thus have 


Fig. i 


d 2 x' . , dy' 

dhf 0 . ,dx' 0 , dz 

m — = — Zmw sin <p — Imw cost* ~r 

dt l r di r dl 

dH' . 0 ,dy' 

m = —mg -|- 2mu> cos (p ~ 


. ( 18 ') 


Leaving the rotation of the earth out of consideration, the and 
/'-components of velocity vanish for free fall. Hence, even when 
the rotation is taken into account, the quantities dx'/dt and 
dy* I di are small compared with dz'/dt so that wo have the sim- 
plified equations 


d 2 x’ 
dt 2 


= 0 


d 2 y' 0 , dz ' 

i=- 2 a, co^ 5r 


. ( 19 ) 


d 2 z' 

di 2 


= ~9 


The first of these equations shows that no deviation occurs 
in the north-and -south direction. Integrating the third equation 
and taking 


■ - (S). - »■ 



XJ 


RELATIVISTIC MECHANICS 


235 


there results 




. ■ ( 20 ) 


Inserting this value in the second equation, 

= +2 cjgt cos (f>. . 

Integrating twice and taking 


. . ( 21 ) 


we obtain 


y ° ~ (*). 0 ~ °* 
y’ = \ ujgfi cos <f>. 


( 22 ) 


This result — an eastward deviation of falling bodies, proportional to 
the cube of the time of fall — agrees with observation. 


Ex. 65. The Foucault pendulum . — Calculate the motion of a freely swinging 
pendulum bob suspended by a thread, taking into account the motion of the 
earth. (The z-component of the velocity may be neglected here.) 

Ex. 66. Using equation (14), express the total change in angular momentum 
of a rigid body in terms of one part relative to the body and another due to rota- 
tion of the system of co-ordinates fixed in the body. Using the result, write the 
equation M = dP/dt in terms of components of to resolved along the principal axes 
of inertia. These equations are called Euler’s equations. 


4. Moving Frames of Reference in Acoustics. The Doppler Effect. 

Before discussing the difficulties which the concept of absolute 
space raises in optics, we shall consider the analogous questions for 
sound. Here the relations involved are perfectly clear and the effects 
are quite definite, on account of the fact that the velocity of sound 
in air is not too great compared 
with mechanically attainable 
speeds. We know that the velo- 
city of propagation of sound in 
air is c = 333 m./sec. In a co- 
ordinate system fixed with respect 
to the source of sound (taken to 
be at rest) and with respect to the 
air, the sound will have arrived at 
a sphere of radius c after one Fig - * 

second. Let us now lay out a 

measuring rod and move it relative to the air. We then measure the 
relative velocity c r along the moving measuring rod, the direction 
of c r being given by that of the rod. Now the absolute velocity must 
always be c; hence if the velocity of the measuring rod is given, in 
direction and magnitude, by v, 



In particular, 

2 ), 


c = c r + v or c r = c — v (23) 

if the rod is moved along its own length (position 6, 
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for motion normal to its length (position c, fig. 2), 

c r = Vc 2 — v 2 (23") 

Moreover, we know that motion relative to the source of sound is 
accompanied by a change in pitch of the tone perceived (Doppler 
effect). First imagine the source and the observer both at rest. The 
source emits v s condensations per second; these follow each other at 
distances apart A 0 = c/v s . The succession of sound waves may be 
thought of as a long rod moving forward with the speed c, the con- 
densations being located on it, like a series of beads, with a separation 
equal to Aq. Hence v M = cj\ = v s is the number of condensations 
passing the position of the observer each second. If we now permit 
the observer to move toward the source of sound, then he receives an 
additional number of vibrations per second equal to the number of 
waves in the distance he covers per second, i.e. a number v/Aq. Hence 

v u = v s 4- ^ = V a + v a ? = v s ^1 + V ^j. . . (24) 

For motion away from the source the positive sign in the parenthesis 
is to be replaced by the negative. 

The Doppler effect due to the motion of the source is essentially 
different, although the effect is in the same direction, i.e. the pitch is 
raised when the distance between source and observer is decreasing, 
and lowered when the distance is increasing. The situation may be 
analysed in this way: If the source were at rest, then the first dis- 
turbance — say a condensation — would be at a distance Aq from the 
source after the time of one vibration had elapsed. But the source 
has moved on a distance v/v = {v/c)A f) in this time, thus decreasing 
the distance between the source and the first condensation by (v/c)Aq. 
This decrease holds for all distances between condensations; for after 
another vibration, the first disturbance has progressed a distance A^ 
but the second starts from the new position of the source, and so the 
distance between the first and second condensations is again decreased 
by (v/c) Aq, and so on. Fig. 3 shows how this occurs. The number of 
vibrations passing the observer each second is thus 


or 


c _ c 
A' “ Ao(l — v/c)’ 


Vu = 




If the source moves away from the observer, the positive sign must 
be used before the second term in the denominator. Comparing this 
result with the case where the observer alone was in motion, we see that 
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the two are equivalent only to a first approximation, i.e. when higher 
powers of vjc can be neglected. The reason is that the medium carrying 



the sound (the air) provides a special co-ordinate frame £ to whioh 
all motion is to be referred. One sees at once that the two cases are 
essentially different by taking v — c. In this instance we have v M — 2v B 
for motion of the observer, while if the source is moving, v M = oo . 

5. Moving Frames of Reference in Optics. The Michelson-Morley 
Experiment. 

According to the older ideas, absolute space — called the ether in 
electrical theory — is the carrier of electromagnetic and therefore of 
optical phenomena. It is therefore natural to apply the results ob- 
tained for sound to optics. Actually, however, this procedure leads 
to unsurmountable difficulties. The existence of an optical Doppler 
effect of approximately the magnitude given by equation (25) has 
been observed in the spectra of stars moving toward or away from 
the earth, as well as in the light from canal (or positive) rays — fast- 
moving luminous atoms (cf. p. 441). If we could increase the velocity 
of the light source, or of the observer, or increase the accuracy of 
measurement to the extent that quantities of the second order in v/c 
could be measured, then we would be able to test the existence of 
the ether. As we saw in Acoustics, if we have a medium at rest, the 
second-order quantities show that the Doppler shift depends not only 
upon the relative velocity of source and observer, but on the velocity 
with respect to the medium. Unfortunately, there is little hope of 
securing an experimental decision concerning the reality of the ether 
concept in this way. Only recently has it been found possible to in- 
crease the accuracy of Doppler-effeot measurements on positive rays 
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to the extent that second-order quantities can be measured (see p. 246). 
It is easier to carry out, with sufficient accuracy, an experiment by 
which the velocity of propagation with respect to a frame moving 
through the ether is investigated. Since light is propagated in the 
ether with velocity c, the velocity of light relative to a system moving 
through the ether with velocity v is c — v. A direct measurement or 
comparison of the velocity of light in various directions on the earth, 
which (considering only the orbit about the sun) moves uniformly 
with a velocity of about 30 km. /sec., is again, it is true, beyond experi- 
mental possibility. A comparison is made possible, however, by using 
the light itself, in a somewhat different arrangement, to measure the 



vanishingly small time differences. This scheme, which had already 
been suggested by Maxwell, was first utilized by Michelson and Morley 
in 1880. It has since been tested many times by various investigators. 
In this arrangement (fig. 4) light from the source L is divided into two 
beams, normal to each other, by the half-silvered plane parallel glass 
plate G. Each beam covers the distance l to the mirrors S x or S 2 , and 
is reflected back over its original path. Assume the arm 1 to be in the 
direction of the earth’s motion; then the relative velocity of the light 
is c — v on the initial path and c + v on the return. Hence the time 
taken to pass to and fro along this arm is 




v 2 

1+ $+ 




In the direction normal to this path the relative velocity of light is 
Vc 2 — v 2 . hence the time required for the double journey along the 
arm 2 is 



2 1 

Vc 2 — v 2 



. . ( 27 ) 
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Thus there 
times: 


exists a difference — a very small one — between the two 


. _ l V 2 

— t^ t 2 - • ^2- 



If we take the orbital speed of the earth to be 30 km. /sec. and let the 
path l be 30 m. long, M — 10" 15 sec. In spite of its smallness, this 
difference can be detected if we use the light vibrations themselves 


instead of a material clock for this measurement. Now the number 
of vibrations N described by yellow light (A = 6 X 10“ 5 cm.) in 10 -16 
sec. is ^ 


N = ^ = M. v = A* . { = 10 

1 A 


-15 


3 . 10 10 


6 . 10- 5 




(29) 


The difference is thus one half of one vibration. If the apparatus is 
turned through 90° the two arms interchange their roles, and the 
difference has the opposite sign, so that the entire difference for two 
positions 90° apart amounts to one complete vibration. Since both 
partial beams originate in the same light source, they interfere when 
reunited (see p. 410) and we obtain a system of interference fringes 
which may be thought of as “ curves of equal thickness ” in the wedge 
formed by S 1 and the image of S 2 produced by reflection in the semi- 
silvered plate. Starting with the arm 1 in the direction of the motion 
of the earth, we have a definite system of fringes. If, now, the appa- 
ratus is turned through 45°, the two arms are symmetrically disposed, 
and the original time difference of T/2, corresponding to a phase 
difference of i r, vanishes. The places which were originally dark are 
now bright, i.e. the fringe system has been displaced by half a fringe 
width. If we turn the apparatus further in the same sense, the opposite 
difference obtains, and when the rotation amounts to 90° the dis- 
placement is an entire fringe width. This is, of course, subject to the 
assumption of the numerical values adopted above. The result of the 
Michelson-Morley experiment, conducted under approximately these 
conditions, was the absence of any such fringe displacement. When, 
in 1925, D. C. Miller reported a positive effect corresponding to a 
velocity of 10 km./sec., verifications were attempted by various in- 
vestigators. Joos, working with improved apparatus at the Zeiss 
plant in Jena, was able to show that if a positive effect does exist, 
it must be smaller than one thousandth of the width of a fringe. Fig. 1* 
Plate I, shows a record of the interference fringes. This record was 
obtained by moving a photographic plate in front of a 0-2 mm.-wide 
slit mounted normal to the fringes, the entire apparatus being rotated, 
meanwhile. Instead of a sinuous line repeating after one half turn, 
we see that the stripes are perfectly straight. Experiment has thus 
tecM against owr acoustic analogy and against the existence of a stationary 

medium carrying light; i.e. the existence of a cosmic ether or absolute 
space is disproved. 
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6. The Relativistic Conception of Space and Time. The Lorentz 
Transformation. 


Disregarding hypotheses which are contrived ad hoc , and which are 
therefore unsatisfactory, the result of the Michelson-Morley experi- 
ment leads compulsorily to a revision of our concepts concerning space 
and time. The negative result of this experiment makes it very evident 
that absolute space is a fiction. But absolute time must also be aban- 


doned. The definitions occurring in the treatment of time by the old 
mechanics are imperfect. For example, what is the meaning of the 
statement: “ Two events took place simultaneously at A and B ”? 
If we postulate an absolute, uniformly-flowing time, there can be no 
doubt as to the meaning of this statement; but physics demands an 
experimental criterion of the correctness of this statement, and this is 


where the difficulties begin. We must have synchronized clocks at 
the stations A and B which show the absolute time. Then the events 


may be said to be simultaneous if the clocks indicate the same time at 


the instant when the events occur. But how can the clocks be syn- 
chronized? An arrangement which suggests itself is the following: 
Place two mirrors, inclined at 45° to the path AB, at the mid-point C 
of AB, so that an observer at C can see the two stations A and B 


simultaneously. We shall say that the two clocks are synchronized 
if they indicate the same time at every instant, when seen at C. But 
from the standpoint of absolute concepts, they will not be exactly 
synchronous even in this case, for the experiment is performed on the 
earth, which moves relative to the ether; hence the light by which 
we see the clocks will require a time l/(c — v) to come to us from one 
clock, and a time l/(c- \-v) to come from the other, assuming the path 
AB (of length 21) to be in the direction of the earth’s motion. What 
value is to be put for v? There is certainly a velocity of the solar system 
relative to the fixed stars to be added to the orbital velocity of the 
earth, and probably a very large velocity of the galaxy with respect 
to the extra-galactic systems. From the absolute point of view, how- 
ever, there may also be a common motion of all matter with respect 
to the ether. The result of the Michelson-Morley experiment relieves 
us of the necessity of answering this question, for it shows that the 
velocity of light is the same in all directions on the moving earth. 
Therefore the above experimental criterion for the synchronization of 
the clocks of a system remains intact. But, as was first recognized by 
H. A. Lorentz, the result of the Michelson-Morley experiment requires 
that two systems in relative motion have two different time measures, 
for it is observed in each system that light reaches a sphere of radius 
c after one second. Hence the Galilean transformation, which sets 
t' = t, cannot be correct. Its insufficiency was not recognized sooner 
on account of the fact that the velocity of fight is so great in com- 
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parison with all mechanical velocities that the deviations can be de- 
tected only by the most refined optical measurements. The fact that 
the Galilean transformation is incompatible with the result of the 
Michelson-Morley experiment may be seen by applying the trans- 
formation to a spherical wave observed in a system at rest. The result 
is the equation of a sphere eccentric with respect to the origin. We must 
therefore transform t> along with the other quantities, in such a way 
that the transformation formulae give a spherical wave for the propa- 
gation of light in every frame of reference, i.e. the identity 

z 2 + y 2 + z 2 - ctf s x' 2 + y' 2 -f z ' 2 — c 2 t' 2 = 0 . (30) 


must be satisfied. 

For the sake of simplicity we take the z-axis in the direction of 
the relative velocity of 2 and S', thus permitting the z-axis to coin- 
cide with the z'-axis. We then make the 
obvious assumption that the transverse co- 
ordinates remain unchanged. Thus the iden- 
tity reduces to 

z 2 — cH 2 = z' 2 — cH ' 2 = 0. . (30') 



z 

Z! 


+ Iff f 

n' 

U 

* U Is ^ 

u 


M \jt' ► 



Fig. 6 


Since there is but one relative velocity v, 
the transformation formula must further fulfil 
the condition that O' has the co-ordinate vt 
in 2, and that 0 has the co-ordinate — vt' in 2' (fig. 6). The 
simplest relations which accomplish this end are 

z' = k(x — vt) 
x = k' (z' -f vtf) 


and 


}• 


(31) 


We eliminate z' from the seco 
obtained from the first, and find 



Putting the values of z' and t' in the right member of (30'), we have 
x 2 — cH 2 ~ & 2 (z 2 _ 2 xvt -j- vH 2 ) 

Tie identity requires that the coefficients of x\ xl and t* vanish sepa- 
rately; to accomplish this, we have only the two constants k and k' 
at our disposal. The coefficient of x 1 yields 


1 M _L o & & I C 2 1 „ 
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The coefficient of t 2 requires that 

— c 2 — k 2 v 2 -f- k 2 c 2 = 0 or Jc = 

The coefficient of xt gives 


1 

V(1 - V 2 /c 2 )* 


( v 2 — c 2 ) kk' -f- c 2 = 0, 


or, after inserting the value of k , 

k' = * 

V(i - v 2 /c 2 Y 


It is now easy to verify that these values of k and k' make the co- 
efficient of x 2 vanish also. 

We thus have the formulae of transformation 



If we let c tend to infinity we obtain in the limit the Galilean trans- 
formation, which is a first approximation to the Lorentz transfor- 
mation for finite v. The ratio v/c will, as is customary, be denoted by 
p in what follows. Using (34), we readily see by differentiation that the 
speed dx/dt of O' with respect to E, and the speed dx'/dt of 0 with 
respect to E', are both equal to v. 


7. Immediate Consequences of the Lorentz Transformation. 

(a) Relativity of simultaneity and of the order of events 

Let two events occur simultaneously at the points and x 2 in 
the frame E, i.e. let t x = t 2 . Then 



vn=T 



f 2 — c 2 X 2 
Vi - P* 



i.e. if and tf are different. Two events at different places x x and x 2 
which are simultaneous for an observer at rest in the frame S no longer 
appear so to an observer moving relative to E. We ask further: 
“ May even the order of two events depend upon the state of 
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motion of the observer?” Let us take t 2 > i.e. t 2 — ^ > 0. Then 



v 


t 2 *1 — g ( x 2 * 1 ) 


Vl- J8* 



In order to obtain a real time difference t 2 — t / at all in the accented 
system, the velocity v of h' must be less than c; t 2 — certainly 
remains positive as long as 



*2 -Si 

~ • 

c 


Now we shall see below that the greatest velocity with which an agency 
connecting a cause and an effect can be propagated is c. If there were 
a more rapid process, we would use it in place of light to define simul- 
taneity. Hence, as long as c is the greatest velocity attainable, the 
order of two causally connected events cannot be reversed. However, 
the order of two events which are not causally connected may be 
reversed consistently with the above relationship. 


(6) The Einstein time dilatation 
Let a clock at give signals at intervals 

At = t 2 try. 

Seen from the moving system, the interval will be, by equation (34), 


A ff — *2 *i 

Vi - ^ vt=w 


• • 


(36) 


Thus the interval At appears to the moving observer to be lengthened. 


(c) The Lorentz contraction 


We perform a measurement of length by laying a measuring rod 
alongside the segment to be measured and reading the interval be- 
tween the division marks which coincide at a given instant with the 
ends of the length to be measured. As long as the measuring rod and 
the interval to be measured are at rest with respect to one another, 
this definition is trivial; not so, however, if there exists a relative 
velocity v between the two. Let the distance l =x 2 -x 1 between the 
ends of the interval be measured by means of a linear scale at rest in 

this frame. According to equations (34) (p. 242), a moving observer 
will note 




X 2~Xi~ V(l 2 - «,) 


Vi 




244 


MECHANICS 


[Chap. 

The times t 2 and t v however, are to be determined in such a way that 
coincidence with the scale divisions occurs at the same instant in the 
observer's system , i.e. we must have, not t 2 = t v but t 2 — t x . From 
the Lorentz transformation we have, for t 2 = t x \ 


t., t j — (x 2 Xj) , 

c 

and so V = x 2 — x x = ( x 2 — x Y )^\ — p 2 . ... (38) 

Seen by the moving observer, linear dimensions are reduced in the 
ratio Vl — P 2 : 1. A body which appears to be spherical to an 
observer at rest relative to it will thus appear to a moving observer 
to be an oblate spheroid. 


(d) Einstein's velocity addition theorem 


In Newtonian mechanics, relative velocities are found by simple 
addition or subtraction (cf. § 2, p. 229). The composition of velocities 
in a mechanics based on the Lorentz transformation is more involved. 
Let a frame S' move relative to S in the direction of the x-axis with 
velocity v. Let a point in S' have the velocity 

= Ux r + u ;f + u ;k\ 


where 



What are its velocity components u x , u v , u z , in S? It follows from the 
Lorentz transformation, by differentiation, that 



dx 

dt 


dx' dt_ M 

dt' dt +V dt , dt' 
and tt 

vi - p dt 




VI -F 



that is 


Solving for u xy 



Hence by (40), 


dr_ 

dt 







. . (42) 



xj 
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Similarly 



On account of the denominator of (41), the sum of two velocities which 
are separately less than c can never exceed the velocity of light. The 
Einstein velocity addition theorem gives the simplest explanation of 
the index of refraction of moving bodies (the Fizeau Effect, p. 477). 

Despite the fact that the velocity of light appears to be an upper 
limit for velocities, it nevertheless cannot be denied that velocities 
greater than c are known. In optics, for example, there are media 
with index of refraction n < 1. This means that the wave velocity u 
is greater than c. But signals can be given only by pulses, or groups 
of waves, which are propagated with the so-called group velocity (see 
p. 65). Sommerfeld has shown that this velocity is less than or equal 
to c for all media. Another example of such a geometric velocity ex- 
ceeding c is the following: Let a long ruler make a very small angle 
9 with the z-axis of a rectangular system of co-ordinates. If, now, 
the ruler be moved in the y-direction with a velocity v, the point of 
intersection with the z-axis will move with a speed u/tand. It is 
evident that this velocity can be made to exceed c by taking 9 suffi- 
ciently small, yet such geometric velocities in excess of c can never act 
as connexions between cause and effect. 


(e) Doppler effect and aberration 


Let a source of light L\ at rest in the frame S', send out a spherical 
wave in this frame. Let the wave be 


represented by 

S' = # . (45) 

where all quantities observed in S' are 
denoted by accents. Let this spherical 
wave meet an observer P at rest in a 
system 2. Let the co-ordinates of P be 



Fig. 7 


x and y in S and x' and in S', taking both the x- and 

a/-ane in the direction of the relative motion of S' with respect to S 

(fig. 7). In S', let the ray from L' to P make an angle 6' with 
the x -axis, so that 


r' = %’ cos 9' -f- y’ sin S' 
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and hence 




2-nV(*'— (*- cos 6" + y' sin 4-iS' 

0 



The observer in 2 interprets the approaching wave surfaces as coming 
from a point-source of light belonging to his system, so that the re- 
lation which holds for him is 



^ g2<rii (t — (* cos 6 y sin 6)lc) 4- iS 

r 



Moreover, the co-ordinates and the time in the two systems must be 
connected by the Lorentz transformation. If we employ these re- 
lationships to express the accented quantities in terms of the un- 
accented ones, then comparison of S with S' must yield a relation which 
is valid for every co-ordinate and for every time. This is possible 
only if the coefficients of x y y and t in the exponents are separately 
equal. Thus from 


vx 

t r 


27TlV 


(x — vt)cos6' y sin0' 


V 1 - p 2 cV 1 - p 2 c 

n . / x cos 6 i/s'mO\ 

= 2lTlv[ t ) . 

V c C ) 

we obtain, after reduction, 

, 1 -f- {v/c) cos 6' 

Vl — ft 2 ’ * ' * 

sin O' Vl — P 2 
1 + (v/c) cos O' ’ 

sin O'Vl — p 2 
cos O' + v/c 


v — v 


a cos O' + v/c . a 
cos 0 — - — : — r~rr Tv > sin U = 


1 + (v/c) cos O' 


tan 0 = 


• • (48) 

. • (49) 



Equation (49) gives the exact expression for the optical Doppler 
effect; this agrees with the acoustic value to terms of second order. 
It is worth noticing, however, that for transverse observation (0 = tt/2) 
there exists a second-order effect here, while in acoustics v = v ex- 
actly. By (50), cos O' = -v/c for 0 = tt/2, and hence (49) gives 

v = v Vl — P 2 i thus a shift toward the red takes place. Unfortu- 
nately, no experiment thus far attempted has been able even to ap- 
proach the accuracy needed for the observation of this effect. On the 
other hand, H. E. Ives was able to increase the accuracy of measure- 
ment of the longitudinal effect as far as second-order terms, by ob- 
serving in both directions along a beam of positive rays and measuring 
the displacement of the mean with respect to the frequency of the 
source at rest. By (49), the mean value is 

i= v '/V(i-P 2 ) = A i + 
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while from (24) and (25), v = v(l + fp). 

The experiment decided in favour of the former value. 

The meaning of equations (50) is the following: If we neglect 
second-order terms, we see from the figure that the source is seen 
from 2 at the position it occupied when the wave just arriving at P 
left the source, i.e. it is seen where it was at a time r'/c earlier. This is 
the so-called time of retardation. In this time the source has moved 
a distance vr' j c, and from fig. 7 we have approximately 

cos 6 = [/ cos 9' + r'v/c] / [/ -f- (r'v/c) cos 9]. 

Putting cos 6' = cos 6 in the small second term of the denominator, 
we obtain (50). If the two systems remain in uniform rectilinear 
motion relative to each other we do not notice this displacement, but 
since the earth reverses the direction of motion in its orbit every six 
months, the displacement of the apparent position of a star also reverses. 
This is the phenomenon of aberration , first observed by Bradley in 1728. 

The ordinary explanation of aberration is based on a ballistio analogy. If 
the ray of light were a projectile making a hole in the objective and in the eye- 
piece of a telescope, the line joining the two holes would, on account of the motion 
of the telescope, deviate from the true direction by the angle computed above. 
Since aberration is a first-order effect (i.e. one depending on v/c), it must follow 
correctly to this approximation from the older mechanics and electrical theory 
as well. 

8. Geometric Representation of the Lorentz Transformation. The 
Four-dimensional World. Calculation with World-vectors. 

In order to compare the transformations of Lorentz and Galileo 
we start with a very simple device — a graphical one-dimensional time- 


Galileo Lorentz 



table (fig. 8a). We choose as abscissa not the time itself, but the pro- 
duct a, which we denote by m. As ordinate we select the distance x 
traversed on a line which we take as the s-axis. Uniform motion will 
then be represented m this diagram by a straight line of slope £ = vjo 
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A light signal is thus represented by a line of slope unity. Now let all 
quantities be converted to a system moving with velocity v. If the 
Galilean transformation t' = t, x = x — vt holds, this means that we 
take a new m'-axis of slope in the diagram, the x-axis remaining 
unchanged. If the unit on the old m-axis is given by the segment OM, 
then the unit on the m'-axis is given by OM', where M' is the inter- 
section of the m'-axis with a line through M parallel to the x-axis. The 
new co-ordinate system is oblique, but the x-axis is the same as before. 

The situation is different if the Lorentz transformation holds in 
place of the Galilean. As we shall show immediately, the new axes 
and unit segments are obtained as follows: We draw (fig. 86) the 
hyperbolas 

m 2 — x 2 = 1 and m 2 — x 2 = — 1, . . (61) 


whose asymptotes m 2 — x 2 = 0 correspond to the propagation of light 
signals. The new m'-axis is drawn as a line of slope P, as above. The 
unit point M', however, is now the intersection of this line with the 
hyperbola m 2 — x 2 = 1. Corresponding to the circumstance that the 
time must also be transformed in the relativity theory, a new x'-axis 
must also be found. This is the line of gradient 1/fi, i.e. the line 
sy mm etric to the m'-axis with respect to the line of slope unity. The 
unit along the x'-axis is determined by the point of intersection X ' of 
this axis with the hyperbola m 2 — x 2 = — 1. We proceed to show 
that these constructions make the transformation from the rectangular 
m, x system to the oblique m', x' system equivalent to the Lorentz 
transformation. First of all, the transformation to oblique co-ordinates 
with the same origin is given by a linear homogeneous transformation 
of the co-ordinates: 

m = A m' -f- yjx , x = vrri + px'. . . . (62) 

We determine the coefficients by means of the unit points. For M' 
we have m' = 1, x = 0, that is, m = A. Again, for the abscissa of the 
intersection of the line x = pm wi th the hyperbola m 2 — x 2 = 1 we 
have l/Vl — £ 2 . Hence A= 1/Vl — ft 2 . For the x-co-ordinate of 
the same point, x = v — fim = j8/\/ 1 — /? 2 . In the same way, from 
the co-ordinates of the point X', 

P J 1 

u = — ■-= and p = — 7" -= ■ 

^ VI-P 2 Vi-fp 


We thus have 


rri + fix’ 

m -VT=T> 


x + P m ' 

Vi =? 


or 



1 + c‘ x 


Vl -p 



or 


x' + vt' 

VT=7 2 



/ 
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This representation provides an interesting partition of the 
atf-continuum. Since the maximum possible velocity for an agency 
connecting cause and effect is c, the straight line representing such an 
instrumentality can have, at most, a slope equal to unity. Hence if 
in fig. 86 the point 0 represents the present , then in the shaded area to 
the right of 0, we find all events which are causally conditioned by 0; 
this region therefore represents the future. All events which them- 
selves have conditioned 0 are found in the shaded area to the left of 
this point — the past. Events whose representative points (“ world- 
points ”) lie in the unshaded portions of the plane cannot stand in any 
causal relation with 0. These points, and only these points, may be 
brought into simultaneity with 0 by transfor min g to a moving system, 
since we can pass an a;' -axis through such points, thereby making 
m' — 0 . If we include the other co-ordinates y and z of three-dimen- 
sional space, the asymptotes become the lateral surface of a four- 
dimensional cone x 2 -f- y 2 -J- z 2 — m 2 = 0; this does not change any 
essential item of our discussion. 

There is, however, another way of representing the situation which 
is much more important for the formal analysis, but which cannot be 
visualized as well as the above representation, since it employs com- 
plex numbers. If we introduce the co-ordinate l in place of m — ct> 
where l — ict, 

we have s 2 + y 2 + z 2 + l 2 = x' 2 + y' 2 + z' 2 + V 2 . . . (54) 


This means that the square of the radius vector remains unchanged 
under a transformation of co-ordinates in the 
four-dimensional x , y> z , l space. As in the 
three-dimensional case, the only transforma- 
tion, aside from reflection and translation, 
which leaves r 2 unaltered is a rotation of the 
co-ordinate system. Actually, it can be shown 
that a rotation of the x, l system of axes 
yields the Lorentz transformation. If the 
Z'-axis forms an angle $ with the Z-axis, 
then, according to the familiar relations of 
analytic geometry,* 

V — l cos <j>- \- x sin (j) 

x' = —l 8m<j) -{- x cos 6 


}■ 



Fig. o 


( 55 ) 


* This , transformation, corresponding to rotation of a plane two-dimensional 
system of co-ordinates, may be verified readily by writing the expression for the 
radius vector in both systems: r 

r-ll + xi- W + tfi'. 

hrom fig. 9 we see that 

/ - /' cos^ - i' sin <f> and i - + *' cos^. 

Inserting these values in the equation for r and equating the ooeffioienta of /' and i' 
to zero separately, we obtain (65). 
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If we set 
that is, 

we obtain 


tan<£= —ip, . . 
cos <f> = ~ 1 sin <j> = 


x = 


Vi-p 2 ’ " " Vi-p 2 ' 

x -f- ?/?/ a: — i?£ 


Vi - £ 2 vT^/3 2 


V 

l -* x 


V = ipX or «'= 

A/1 — /3 2 Vl — jS 2 j 


^ f 




i.e. the Lorentz transformation. 

Thus we have the important relationship discovered by Minkowski, 
that the transformation from one system of reference to another 
moving relative to the first with a velocity v corresponds to a rotation 
of axes in the four- dimensional x y y, z, l world, the angle of rotation 
given by equation (56). This makes it possible, by e 
ordinary vector analysis to four dimensions, to derive generally valid 
laws in the form of equations between four-dimensional vectors. In 
what follows, these vectors will be called world-vectors , and will be 
denoted by bars beneath their symbols. Just as the ordinary vector 
equations representing the laws of Newtonian mechanics may be re- 
solved along any co-ordinate system, so the co-ordinate resolution of 
world-vector equations yields a representation for a given system of 
reference. Resolution according to a system turned with respect to 
the first gives the representation for a reference system in motion 
relative to the first one. 

The following rules hold for calculations with world- vectors: Addi- 
tion and subtraction are defined as the same operations performed on 
corresponding components. Graphical addition would, of course, be 
possible also in four dimensions, according to the methods of descriptive 
geometry. Since the four-dimensional co-ordinates are also orthogonal, 

ii=l, jj=l, kk= 1, 11= 1, ij= 0, ik = 0, 

il =0, jk = 0, &c. (59) 




Hence the scalar product of two world-vectors 

u = u x i + u v j + u z k -f u x l and v = v x i + v v j + v t h -f- v x l 
is defined as 

uv = u x v t -J- u v v v -f u t v a -f u x v x . . . . (60) 

The definition of the vector product is somewhat more difficult, since 
this combination really signifies an antisymmetric tensor, and it is 
more or less fortuitous that such a tensor in three dimensions can be 
represented by a vector. In four-dimensional space an antisymmetric 
tensor has six components, a vector has four. We define as the vector 
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product [mo] of two world- vectors the antisymmetric 
<X>, whose components are the determinants of the matrix 

II U. Ui II 


V, Vi 


i.e. 


<t> xv = u x v v — u v v x , O x , = u x v z — u 2 v x , d> xl = u x v t — UiV x \ ^ 

O y , = UyV, — U t V v , <& v i= U v V t — UiV v , 0 2l == U t Vi— UiV 9 !' 

Corresponding to the three-dimensional A-operator we have a four- 
dimensional operator 

whose combination with world- vectors or scalars gives, in complete 
analogy with the three-dimensional case, the following differential 
operations: 

grad„= Q u = J _ + ^_ + >%_+/_. . . ( 62 ) 


dx 


dz 


dV 


(62) 


_ dv x , dv v . dv t . dv, 

tov-Bv—si+si + Trl + TT' 

AAAI 

curl v—[Uv\= dx dy dz dl . 


(63) 


(64) 


In place of the equation div curl v — 0 we have a somewhat more 
complicated property of world-tensors formed by taking the ourl of 
world-vectors; viz. if <D = curl v, i.e. 








dv a 

dl' 

•a 
a r 


then we have 
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9. Newton’s Second Law in the Theory of Relativity. The Variability of 
Mass and the Inertia of Energy. 

The Galilean transformation forms the basis of Newtonian me- 
chanics. It is our task to modify the laws of mechanics in such a way 
that they become compatible with the experimentally confirmed 
Lorentz transformation. It is true that the velocities which are met 
with in the mechanics of macroscopic bodies are so small that proof 
of the correctness of the transformation by means of terrestrial ex- 
periments has so far not been found possible; but we can impart to 
electrically charged particles (electrons) velocities approaching c, 
which therefore permit of a test. The formulation of the relativistic 
basis of mechanics is usually approached from electrodynamics, but 
it is also possible to derive the new laws from purely mechanical con- 
siderations. Adopting the latter method of approach, we have to 



Fig. 10 


introduce the concepts of quantity of motion (momentum) and of 
energy into the mechanics of the Lorentz transformation, and de- 
termine the corresponding mathematical expressions in such a way 
that the laws of conservation of momentum and of energy are re- 
tained. 

The simplest process whose result is conditioned by these laws of 
conservation is that of the impact of two equal spheres. Following 
Tolman, we use this case for a hypothetical experiment. Imagine an 
observer located on the y-axis of a frame or system 2 and another 
observer on the y’-axis of a second system 2\ Let the first observer 
throw a ball along the y-axis, the second one along the negative y 7 - axis 
(fig. 10). The masses of the two spheres are to be equal, when measured 
in the same system. Assume the two velocities of projection, as 
measured each by the corresponding observer, to be equal and oppo- 
site. Let a collision take place as a result of the motion of 2 , and let 
the instants at which the spheres are thrown be so chosen that the line 
of centres at the instant of impact has the direction of the y-axis. 
Since the spheres are assumed to be smooth, they cannot exert tan- 
gential forces on one another, and so the ^-components of the veloci- 
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ties are unchanged by the impact. If we denote the ball thrown by 
observer A in system 2 by a, the other by 6, then, according to the 
rule for the addition of velocities, the velocity components before 
impact are 


as seen from 2: 

as seen from 2': 

Wax= 0 

w' ax = — v. 


, / v 2 

£ 

G 

11 

w av =u^j 1__ 

Wbx= V 

w' bx = 0 


U)\y = —U 


( 66 ) 


Let all quantities after impact be denoted by bars over the symbols 
In order to be able to apply the law of the conservation of momentum 
we make the assumption that, in relativistic mechanics also, the mo- 
mentum vector has the direction of the velocity vector. Here, how- 
ever, we replace the scalar factor which represents the constant mass 
of the sphere in Newtonian mechanics by a variable multiplier a. 
Being a scalar, this can depend only upon a scalar quantity, say the 
magnitude w of the velocity. Under the above hypotheses, the x - 
component in the system 2 yields 



or 


u= Az u. 


( 68 ) 


Newtonian mechanics requires that u — u in this case. Since 
relativistic mechanics must reduce to the Newtonian form for small 

velocities, we must use the negative sign here. Further, the u-com- 
ponent yields the equation 9 


= «(«) • « - + « 2 (l-J)).« A / 1 - t 


This is satisfied for u = ~u if 




(70) 
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If we now let the velocity of projection become smaller and smaller, 



For very small velocities, however, we must arrive at the Newtonian 
expressions, in which the scalar factor a is the mass m 0 . If we call 
this scalar factor the mass in the relativity theory also, then the result 
states that the mass increases with increasing velocity. Denoting 

the mass corresponding to a state of rest by m 0 , we obtain the funda 
mental relation 



In what follows, the quantity m will be more accurately termed the 
momenlal mass or the apparent mass. 

We now formulate the fundamental law of mechanics in the original 

Newtonian form, in which the force is set equal to the time rate of 
change of momentum: 



This differs from the old form in that m may no longer be placed 
before the sign of differentiation, since it is now variable. 

According to this equation, the inertial ?nass , i.e. the resistance 
offered by a body to acceleration, is not now identical with the ap- 
parent mass ; in fact the inertial mass in a frame of reference in which 
the body has a speed v will actually have different values in the two 
cases when the acceleration takes place in the direction of the initial 
velocity and normal thereto. To show this, we take the z-axis of 
our co-ordinate system in the direction of the velocity, while the 
acceleration is taken to be in the zy-plane, i.e. 




dh) _ dh _ 
w~ a " dp ~ 


Carrying out the differentiation in equation (73) and putting v x — v , 
we have 



™0 V x 
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Thus the inertial mass occurring as the factor of proportionality in 
the equation connecting force and acceleration has the magnitude 



for the x-component, and the magnitude 



for the y-component. This means that the force vector and the 
acceleration vector have different directions; the acceleration makes 
a greater angle with the velocity than the force does, since the longi- 
tudinal inertial mass is greater than the transverse inertial mass. 

The formulation of the Second Law given by equation (73) pre- 
supposes a particular frame of reference. For a frame in motion rela- 
tive to this frame, the quantities entering in (73) must be transformed 
according to the Lorentz transformation. This is unsatisfactory from 
the standpoint of the relativity theory. A generally valid relationship 
ought to have the form of an equation connecting world-vectors, 
which may be resolved (i.e. split up into components) in any four- 
dimensional co-ordinate system, so that the relationship remains valid 
for any given frame, or for any frame in motion relative to it. We 
now attempt to formulate such relationships. First, we note that at 
any given instant one special frame is marked out from all others, 
viz. that which has the same instantaneous velocity v as the particle! 
We shall call this the rest-frame. The time in this frame we call the 
local time or proper time r. Hence for a frame in which the velocity of 
the particle is v , and the time is t , we have, by equation (36) (p. 243), 



Take now the four-dimensional radius vector by which the position 
and tune of the particle are specified; 

r = xi- f- yj + zk + ll, 

where, as at (54). p. 249. l = icl. By differentiating with respect to 
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the proper time, we obtain a new world-vector q which we call the 
four-velocity. For a particle at rest, this vector has the value q=icl, 
on account of the vanishing of the derivatives dx/dr, dy/dr ancT dz I dr 
The square of the magnitude of the vector q which is independent of 
the co-ordinate system, is -c 2 , a constant for all motions. In another 
four-dimensional system of co-ordinates with time t , in which the 
particle has the velocity v , 



dr dt idx . dy . dz , , . _ i 
dt <It~ [<U t+ dt J + dt k + lcl ) 




The spatial components of the world-vector are thus given in this 
system by the components of the ordinary vector 



and the time component (Z-component) is given by 

ic 



Hence in the above form of the Second Law, hi (73), p. 254, we can 
put the spatia l components of q, combined into a vector q„ in place 

of v 




We now introduce in place of the force F a world- vector P — the “ four- 
force ” — whose spatial components are identical with those of F in 
the rest-frame, but which satisfy the relationship 





in another reference frame. Then the Second Law has the familiar 
form 

P = K)£) ( 8 °) 


This form leads to a further important result. 
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If, in analogy with the method of proving the theorem of the 
conservation of energy in classical mechanics, we take the scalar pro- 
duct of both members of (80) with q, we obtain 


1 9 = j; K q)q = Kf 2 ) = o. 


(81) 


The right side vanishes, since the magnitude of q is the constant ic. 

If the left member be written in components, the spatial components 
being combined into ordinary vectors, we have 


Pr g r + P,q t =o, 


i.e. 


1-* 

c z 


+ Pi 


1- 


i-Z 


= 0. 


The product Fv is the work done per unit time. If we replace P, 
on the left side by the value 


P 

l ~ dr 


a m 0 ic 
~ 


_ d 

v 2 dt 

c 2 


m Q ic 

t \ __ v 2 
c 2 


obtained from equation (80), we obtain 

; d m 0 c 2 dN 

Fv= dti~^^-dt’ s& y- 

The quantity N introduced here has the value 

at WoC 2 n - 

^ ^ = m o<?+lm 0 v?+ fm 0 ^+ , . . . 


( 82 ) 


to which a constant of integration must still be added. If we give 
thni constant the value -m a c\ we see that the development of the 
radical is a fust approximation to the kinetic energy T of Newtonian 
mechanics. Thus the conservation of energy retains its usual form in 

^ the ., WOrk d ° ne per 8eeond k e 1 ual to the Crease in ldnetio 
energy per umt time. 

sioiSener^ quantity m « c2 ’ whi( * tas the dimen- 

ZZ 1 Tr 87 11 W ° teke the co f tent integration to be zero, the 

ikl 6 “ I Xt natural t0 interpret this to mean that 
inertial mass represents a large accumulation of energy and that 

10 * 

(B 711) 
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reciprocally, to every amount of energy E there corresponds an in- 
ertial mass 

m o = ~ 2 (83) 

1 / 

This is the Principle of the Inertia of Energy .* 

According to this law, the inertial mass of a body is increased 
whenever energy is supplied (e.g. by heating). The energy available 
in the laboratory, however, is far too small to make this increase of 
mass observable in any way, but it is otherwise with the enormous 
energy changes involved in nuclear processes (see Chap. XLII). The 
loss in mass of the stars, resulting from radiation of energy, is also 
considerable (cf. Ex. 67). 

Ex. 67. What is the annual loss in mass of the sun, if approximately 2 calories 
of radiated energy are received by each square centimetre of the earth’s surface 
per minute? (The distance from the earth to the sun is about 150 X 10 6 km.) 

10. Fundamental Concepts of the Generalized Theory of Relativity. 

The Lorentz transformation refers only to frames of reference 
which are in uniform rectilinear motion with respect to each other. 
We have seen that the assumption of an absolute space is superfluous 
and in contradiction with experience for this type of motion. But the 
situation is very much more difficult for frames in accelerated motion, 
e.g. in rotation. The phenomena accompanying rotation do seem to 
point to the existence of an absolute space, for which alone Newton’s 
Second Law is valid; for the occurrence of inertial forces, such as 
centrifugal or Coriolis forces, indicates that the bodies are in accelerated 
motion with respect to such a space. But is it not more advisable to 
say that the accelerated motion takes place with respect to the re- 
maining masses of the universe? If these masses, and not a cosmic 
ether, are the cause of centrifugal forces, then such forces would have 
to vanish if the remaining material of the cosmos were removed. But, 
unfortunately, this is an experiment which one cannot perform. Never- 
theless, as early as 1896, B. and J. Friedlander undertook experi- 
ments to detect the existence of feeble centrifugal forces on stationary 
bodies placed near large, rapidly rotating masses, the latter being 
intended to take the place, in a certain measure, of the masses of the 

* The history of this far-reaching law is, briefly, as follows: As early as 1887, Voigt 
took up the discussion of the time transformation in a little-noticed paper, and in 
1892 Lorentz revived the question in connexion with the optics of moving media. In 
1900 T Ar mor wrote the Lorentz transformation in the form we know it to-day, and 
four years later Lorentz published a paper in which he treated the optics of moving 
media entirely from the standpoint of the Lorentz transformation. In 1905 Poincar6 
and Einstein independently arrived at substantially the conclusions given in § 7. 
Einstein’s paper contained the mass-energy rolation. Shortly before this, Hasenohrl, 
in Austria, treated the special case of a radiation-filled enclosure, showing that the 
contained radiation would oppose any acceleration of the container as though the 
radiation had a virtual mass given by (83). 
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of the masses in question, the 
experiment would be doomed to failure even now. Even so, we may 
claim that a theory which explains inertial forces through the action 
of matter has the advantage of not introducing a hypothetical space 
(cosmic ether), whose existence has not been verified in the realm of 


universe. On account of the disparity 


uniform motion. But the path which leads to a strict theory based 
upon these ideas is long and arduous. In the first place, we must give 
up something which we are naturally inclined to take for granted: 
the validity of Euclidean geometry in our four-dimensional space-time 
continuum. This geometry, which was fundamental in the special 
theory of relativity, appeared in the expression for the square of the 
radius vector r 2 = z 2 + y 2 -f z 2 + l 2 (Theorem of Pythagoras). 

The idea that the physical space in which we live might not have 
the properties of Euclidean geometry is not new. Riemann expounded 
the view that the metric of space is a physical property, and not a 
matter of intuition. As long as we measure small triangles on the sur- 
face of the earth with ordinary rough apparatus, we find that all the 
propositions of Euclidean geometry — in particular that concerning 
the sum of the angles of a triangle — are valid. However, if we deal 
with larger areas, the straight lines are replaced by chains stretched 
over the earth’s surface, and more exact measurement reveals that the 
sum of the angles of a triangle is greater than n. In exactly the same 
way, a triangle in space, whose sides are formed, say, by light rays, 
might have an angle sum different from 7 r; in fact, Gauss tried in this 

way, though without success, to find such a departure from the 
Euclidean metric. 


From the epistemological standpoint, the idea here advanced 
means nothing less than a reintroduction of the ether, since we are 
really once more ascribing physical properties to space. But we can 
dispense with this physical space by using the formula: “ It is mere 
assumption to regard space-time as Euclidean. By suitable hypotheses 
vnth regard to the s metric of the space-time frame, a more perspicuous 
representation of the world may be obtained.” * F 

Einstein’s fundamental idea in the generalized theory of rela- 

frt f-lTT the f etrio of the sP»ce-time continuum by 
meanaof thedi8tnbut lona a d velocity of matter in such a way that 

in a frame of reference relative to whioh the matter is in accelerated 

motion the inertial forces result automatically. In this connexion 

menlS^n n 23 e i qU1V A len ° e ,° f ^ vi * atiollal and acceleration^ 
mentioned on p. 231*. As explained there, this equivalence is based 

all h Wial ^ g^tational mass for 

r the basis of the generalized theory of relativity, just 

the bLKthe Zri il 0 ex P er T nt to detect mi ether-wind foLas 

f he speolal theor y- In the generalized relativity theory the 
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fundamental law of mechanics is: A particle (or light ray) upon which 
no electromagnetic forces act describes a line of stationary length 
(geodesic, or generalized straight line) in the space-time continuum, 
the metric of the continuum being determined by the distribution and 
velocities of the material bodies. This connexion is mathematically 
very complicated and cannot be expounded in detail here; the ex- 
pression for a geodesic in the non-Euclidean metric is also compli- 
cated. The new feature is that gravitation, which is not included in 
electromagnetic effects, appears in this manner as a property of space, 
and that the same centrifugal forces would arise whether the earth is 
considered to turn on its axis or the rest of the universe to be rotating 
about this axis. Hence the conflict between the points of view of 
Ptolemy and Copernicus vanishes. 

There are three cases where the generalized theory of relativity 
leads to small deviations from the results of the old physics. The 
calculations yield the result that the perihelia of the planetary orbits 
advance by very small amounts. For the planet Mercury, where the 
effect is greatest, the amount of this advance is computed to be only 
43" per century. Again, on account of the change in the metric in the 
neighbourhood of heavy bodies, a light ray passing near the sun on 
its way to us from a distant star will suffer a small deviation, so that 
the position of a star whose line of sight grazes the sun will be some- 
what displaced. This effect can be looked for at the time of a total 
solar eclipse. Finally, the change in the time metric caused by gravi- 
tation causes spectral lines originating in a strong gravitational field 
(e.g. that of the sun) to be shifted very slightly toward the red, when 
compared by an outside observer with fines generated in his own 
system. All three effects exist. However, it is a matter of extreme 
difficulty to allow correctly for all disturbing influences, so that we 
are not yet justified in speaking of a quantitative verification of the 
generalized theory of relativity. 

Since the idea of determining the metric of the space-time con- 
tinuum in such a way that the laws of nature are automatically con- 
tained therein has been successful in the case of gravitation, it was 
natural to attempt to extend the notion to electromagnetic phe- 
nomena also. Einstein, by means of a still more general geometry, 
actually was able to devise a unified field theory which includes elec- 
tromagnetism. Nevertheless, it must be recognized that such a corre- 
lation embraces only one aspect of the universe; it fails to explain 
atomic structure — the electrons, protons and quanta. 

With the large particle speeds now available in the laboratory, the 
special theory of relativity has become indispensable in atomic and 
nuclear research. However, up to the present time at least, the general 
theory has been applied only to cosmological physics, and more specific 
connexions with experiment are still lacking. 



PART TIT 

Field Theory op Electromagnetic and Optical Phenomena 


Two different methods are employed in the theoretical treatment of electrical 
and magnetic phenomena. The first method starts with macroscopically measur- 
able quantities and describes mathematically the relationships between such 
quantities found by experiment. An enormous range of phenomena may be 
brought within the scope of this method in the form of a system of differential 
equations. The integration of these equations for particular cases then permits 
us to answer by rigorous calculation, a large number of questions, including 

°. f tec ^ lcal importance. This method employs only experimental laws 
and their mathematical consequences; it is therefore entirely free of hypothesis 
and cannot come into conflict with experience. Since it makes no assumptions 
whatsoever concerning the structure of electrical charges or of matter, we call 

‘K th ® °° nt “ uum the ?«7. ° p ‘he field theory. Nevertheless it was soon recog. 

a , l " ge “““ber of very striMng phenomena, e.g. those of electrolysis, 

nhfinnmpnf f * ° Ut ^ P ° mt ° f view “ taken - In ordep to explain these 

T “ S ake as8um P tlons concerning the structure of matter and 

r tf: t u Cann0t Verif y th “ e resumptions by direct observation, 
but only by testing their consequences. This second method will be called the 

atomistic method. While the continuum theory will be treated first wa ahull 
contrarv ^e ^ S ttemp ? 10 exclude all reference to atomistic concepts; on the 

P ictu r r7of rhVphiL™^-™" s!i:. of a 8ide - g,ance at the atomi8t - 




CHAPTER XI 


The Electrostatic Field in a Vacuum (or ln Air) 
1. Definitions. 




Fig. i 


r 


(a) Electrical field strength and charge 

In order to arrive at a definition of electrical field strength we make 
use of an arrangement whose real purpose we shall not learn until 
later (p. 425). Two horizontal metal plates (fig. 1) are placed opposite 
each other at a distance of a few centi- 
metres and are held in this position by ■** 

amber supports. By means of an atomizer, 4— 

a cloud of very fine droplets of oil is 

blown into the space between the plates. 

The drops are illuminated from the side 
and may be observed in a telescope. 

We observe that, under the combined effect of gravity and air 
resistance (cf. Stokes’s theory, p. 217), the drops fall with uniform 
velocity. We now touch the upper plate with a glass rod which has 
been rubbed with silk, the lower plate being connected to the earth 
by a wire. It is observed that the motion of the droplets changes: 
some rise, some fall with various speeds, and others remain suspended 
in space. If we concentrate attention on a single drop, we observe 
that it changes its velocity now and then, especially if some radium 
is placed near by. The direction of motion, however, is always normal 
to the plates. We conclude from this observation that a new force 
whose cause is connected with the touching of the upper plate by the 
glass rod, acts on the droplets. The space between the plates has thiw 
become the seat of a field of force which is superimposed upon the forces 
already present. We find further that the magnitude of the force and 
its algebraic sign depend on the state of the droplet. Hence we set 
the force vector F equal to the product of a veotor E, independent of 

the drop- ° f ^ dl ° P ’ “ number e whiob we call the charge * on 

F = eE (1) 

E is called the electrical field strength. By observing the changes in 

• The droplet become. charged when it ie ionned by atomitation of the oil. 
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the operating force, as revealed by changes in the velocity of rise or 
fall of the drop, we conclude that the droplets alter their charges 
under the influence of the radium radiations. 

The algebraic sign of the charge is conventionally determined in 
such a way that a particle repelled by the upper plate — the one touched 
by the glass rod — is said to carry a positive charge. In the light of 
our present knowledge, this convention is not a happy one, since the 
most important charged particle, the electron, is thus given the nega- 
tive sign. 

(b) Lines of force and flux 

Suppose that we have determined the field strength E at every 
point of space by means of a test body whose charge we arbitrarily 
take to be unity. We can then specify the field strength at every point 
by a vector. However, such a representation does not afford a very 
good general view of the whole field. It is better to draw curves whose 
tangents give the direction of the field at any point. Then the strength 
of the field must be specified in some other way. We still have a choice 
as to how close together we shall draw these curves, the so-called 
lines of force. We adopt a method which gives the density of the lines 
of force as a measure of the field strength. The lines of force of the 
electrostatic field form an orthotomic system, i.e. they are the ortho- 
gonal trajectories of a family of surfaces. This is not true of all vector 
fields. At every point of space there is a certain plane, which is tangen- 
tial to the orthogonal surface at that point. We arrange matters so 
that the number of lines of force passing through each unit area of the 
orthogonal surface is proportional to the field strength. The number 
of lines of force passing through an arbitrary area is called the flux 
or induction O through that area. As may be seen from fig. 16 (p. 23), 
the flux through an arbitrarily oriented element of area dS amounts to 

dO = EdS (2) 

Hence the flux through any finite surface S is 

O = f EdS (3) 

J 8 

2. Electrical Charge (Quantity of Electricity) as the Source of Flux. 

If we take the integral j) EdS over a closed surface S, this gives 

the excess of the number of lines of force leaving the volume bounded 
by S over the number entering. If, in the experimental arrangement 
of § 1, we take the surface S so that it does not intersect the plates, 
we find that the integral vanishes: * that is, no lines of force originate 

• With the proviso, of course, that there are now no charged particles between the 
pUtes. 
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in the space between the plates. The situation is different if our sur- 
face encloses, say, the upper plate. The surface integral now has a 
definite value, and we can connect the non-vanishing of this integral 
with the fact that the plate was touched by the rubbed glass rod. 
We naturally conjecture that we have thus given the plate an electric 
charge, and can verify this directly by bringing the plate into a similar 
system of larger dimensions; we shall then recognize the presence of 
a charge by the resulting forces, just as in the case of the oil drops. We 
take the value of the integral as a measure of the charge within the 
enclosed region, and set 

eoc^EdS, (4) 


the units being still undetermined. The electrical charge or quantity 
of electricity thus plays a double role: an active part as the generator 
of a field, and a passive part as the object upon which the field acts. 
The density of charge at a point P of space is the limit of the ratio of 
the charge within a volume element containing P to the volume of 
this element: 



div E. 



That is, at every point of space the divergence of the electrical field 
strength is proportional to the density of charge at that point. The 
limit of Ae/Ar is not always finite; in nature we find surface distri- 
butions of charge as well as volume distributions. If we construct a 
cylinder whose base is A S, and whose generators are normal to a charged 
surface, we can make the altitude of the cylinder small of a higher 
order than the base without changing the amount of electricity en- 
closed by it. In this instance we define the surface density of 
charge a to be the limit of Ae/A S, i.e. the amount of electricity 
on unit area of the surface. Let us take the surface integral 
ot the electrical field strength over such a small cylinder fcf 
p. 43, fig. 27) We distinguish the two sides of the surface by 
the subscripts 1 and 2, and suppose the normal to be directed 

from 1 towards 2. Then, smce the lateral surfaces are small of a 
higher order than AS, we have 

ASfwEg— nEj) oc Ae. ..... (6) 

If the limit of Ae/A 8 is not zero, «E 2 - *H, must be difierent from 
«E ' ? h f the n0 V nal com P°aent of the field strength, 

“is ^ fi6ld 8tren § tL The ezpreiion 

^ on „ « w d the ®, UrfaCe di Y ergonco of E (written Div E), 
° n p - U - We ^ thus “y that for o surface distribution of, 
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charge , the surface density is ■proportional to the surface divergence of E. 

Div E oc a (7) 


To bring the quantities e, E into connexion with the absolute 
system of units, we start with the field of a single point charge e. From 
symmetry we see that the lines of force are radial and of equal density 
in all directions. The orthogonal surfaces in this case are spheres with 
centre at e. The induction through a sphere of radius r amounts to 



Since the density of the lines of force is the same everywhere, | E | is 
constant over the surface of the sphere and 


<D = (f) EdS = 4tt7 2 | E | = ye, i.e. | E | = 


ye_ 

47rr 2 ’ 


where y is a factor of proportionality. The force on a second charge 
e' becomes 




This is Coulomb's Law * 

The units are determined by specifying the factor of proportionality 
y. If we place y = 4tt, we obtain e in the absolute electrostatic system. 
In addition, however, there are three other systems of units in use. 
To avoid confusion it is desirable to use only two systems. One is the 
C.G.S., introduced above, in which all electrical quantities are measured 
in electrostatic units (e.s.u.) and all magnetic quantities in electro- 
magnetic units (e.m.u.), to be introduced later. The other is the 
rationalized metre-kilogram-second (M.K.S.) system, sometimes called 
the Giorgi system after one of its first proponents. The fundamental 
mechanical and electrical units in this system are the metre, kilogram, 
second, ampere and volt, the size of the electrical units being given 
by their legal definitions in terms of the amount of silver deposited 
electrolytically and the electromotive force of a standard cell, respec- 
tively. The M.K.S. unit of force is called one newton, and is equal to 
10 5 dynes. The C.G.S. system is still indispensable in atomic physics, 
for all numerical constants are specified in terms of these units, but 


* If we examine the familiar diagram giving the hnes of force in the neighbourhood 
of two equal and opposite point charges, we see that aU the lines of force ongmatmg 
at the positive charge converge into the negative Hence the field is nolonger sph 

above gives the forces acting upon a third charge introduced into the field. 
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the M.K.S. system has such great advantages in macroscopic problems 
that its use is becoming more and more widespread. In using M.K.S. 
units, however, it must be remembered that even quantities such as the 
number of molecules per mol differ from the familiar C.G.S. values, for 
in the M.K.S. system the kilogram mol is used in place of the gram mol. 

For comparison, the more important electrical formulas will be 
given here in both systems. The M.K.S. unit of quantity of electricity 
is the ampere-second (coulomb), and that of field strength is the volt / 
metre. Thus the relation between charge and field strength as given 
by (4) contains a dimensional factor of proportionality, called the 
permittivity (or capacitivity) * of free space: 

K 0 — 8-859 X 10 -12 amp. sec./metre volt. 


According to (8), we then have, in 

C. 0.8. units 

(y 


(j) E dS = 477g. 
div E = p. 
Div E = 4t7<j. 



4») 

. . . (4<z) 

• • • (bfl) 

• • ■ (7a) 

. . . (8a) 


M.K.S. unite 

(y - i/*o) 

e~K 0 E dS. . . 

p = K 0 div E. . . 

<r = K 0 Div E. . . 
■p _ ee' 

4- nK or 2 * * * 


(46) 

(56) 

(76) 

( 86 ) 


3. The Electrostatic Potential. 

placement; 0 ^ ^ ^ Wten ^ ctarge e experiences a dis 


If e is moved 
work done is 


dW = F ds = eE ds. 



a finite distance along a curve from P 0 to P v the total 


rp { p 

W=J eE ds = e f Eds. 
p* J p, 


( 10 ) 


This work is found to be independent of the path G over which the charq, 


j) E ds = 0. 


(10') 

This means that the field strength E is inotational. Now we kno’ 
from vector analysis that every irrotational veotor may be repre 

diekowL .*®™ 8 diCkCtTk “ m * tant “ d **««• «■«*• opacity refer to materi, 
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sented as the gradient of a scalar. This means a great simplification: 
instead of giving the numerical values of three components at each 
point, it is sufficient to give the numerical value of one scalar at every 
point. Then by taking the gradient, i.e. by pure differential operations, 
the vector components may be found. The field considered above is 
of this kind, for in this field 

dW = grad IF ds, 

and by equation (9), for e = 1, 

dW = E ds, i.e. E = grad IF. 

However, it is more convenient to work with the quantity (—IF), in 
analogy with potential energy in mechanics. This would be the work 
done against the field when unit charge moves. This quantity is called 
the electrostatic potential V* It is to be noted that the product eV , 
not F itself', has the dimensions of work. In terms of this concept we 
have j, 

E = — grad V ; f Eds = V 0 — V v . . . fll) 

j p. 

Taking the divergence of the field strength, we have by (5a) 

div grad V = AF = — 4 Tip, or A 0 AF = — p. . (12) 


This differential equation for the electrostatic potential is know-n as 
Poisson's Equation. In particular, at all points of a region devoid of 

charge ’ AF = 0 (13) 


which is Laplace’s Equation. 

On account of the importance of the equations of Poisson and La- 
place, we give the expressions for AF in the commonest co-ordinate 

systems (cf. Ex. 22, p. 43): 

In rectangular co-ordinates: 


AT7 d 2 V d 2 Y d 2 Y 
AF_ dx 2 + 0y 2 + 0z 2 ‘ 



In cylindrical co-ordinates: 


AT7 d 2 v , i dv , i a 2 F s 2 f 

A V ^ O I '-v I O ^ 10 "l 


op 


pdp^ p 2 d<f> 2 ' dz 2 ' 


(15) 


In spherical co-ordinates: 

af = ^ + ?|E + 1 


0=F 1 1 g dV 

rs a* 1 —2 


dr-' r dr'r 2 8m 2 0d(f) i r 2 dd 2 r 


dd 


(16) 


* More accurately, it is the diSerenco in potential between the end points P„ and 
p, of the path. Cf. equation (11). 
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Ex. 68. Is it possible to produce a uni -directional electrostatic field in 
which the magnitude of the field strength increases in a direction normal to E? 

Ex. 69. At the surface of the earth the electrostatic field in the atmosphere 
is 100 volte/m., and at a height of 1500 m. the field strength is 25 volts/m., the 
field being directed toward the centre of the earth. What is the surface charge 
of the earth and what is the mean space charge in the atmosphere between 0 
and 1500 m. altitude? 

Ex. 70. In a gas discharge between parallel plates, the potential is measured 
as a function of the distance from the cathode, using probes. Explain how the 
field strength and the distribution of space charge can be found from these results. 


L Simple Examples of the Electrostatic Field in a Vacuum (or in Air). 

The problem of calculating an electrostatic field may be considered solved 
when the distribution of the potential has been found, for the field is determined 
from the potential by taking the gradient, i.e. by simple differentiation. The 
problem is to find solutions of the differential equation for the potential which 
are in agreement with the particular conditions of the problem in hand. This 
problem may be solved at once if the distribution of charges in the entire space 
is given. But this is generally not the case. We must take into consideration 
the fact that there are substances — so-called conductors of electricity — within 
which no electrostatic field can be maintained, but upon which the charges may 
move freely. Every surface of a conductor must therefore be a level surface of 
the field, and we must find a solution of AF = 0 which satisfies the boundary 
conditions imposed by the positions of the conductors. It is only after this has 
been done that we can determine the distribution of charge by evaluating 

Div grad V. Under these conditions the problem is considerably more difficult 
to solve. 


A. No Conductors in the Field . 

(a) Potential due to 'point charges 

We begin with the simplest instance of a given distribution of 
charge, viz. the simple point charge. Its field is spherically symmetrical, 
and has already been calculated in § 2 (p. 266). The corresponding 
potential might be set down at once, but we wish to proceed in a 
somewhat more general way. By equation (16) (p. 268), the equation 

for the potential m a space devoid of charge (Laplace’s Equation) is 
for spherical symmetry, * 


The integral is 






• • 


. (17) 

• (IS) 


B is the value of the potential at infinity. We normalize the potential 
m such a way that it vanishes at infinity. This can always be done, 
since any system of charges in the finite region appears to be con- 
centrated at a single point when viewed from infinity. Hence we may 
specify, as a boundary condition of the diflerential equation, that V 
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vanishes at least to the order 1/r at infinity. The constant A is de- 
termined by taking the surface integral over a sphere with centre at 
the point charge. The value thus found by equation (4a) (p. 267) is e, 
while (46) yields e/47 tK 0 . 

If several point charges are given, their fields are superimposed 
without interference, and so the combined potential is the sum of the 
potentials due to the separate charges: 



or (in M.K.S. units) 2 — % — . 

4wff„ r { 


(18a) 


For a continuous distribution of charge in space or upon surfaces the 
sum becomes 




pdr 

4:7rK 0 r 


r adS 

+ J 4 ^K~r 



This equation expressing the independent superposition of the fields 
may be derived directly from the differential equation with the help 
of Green’s theorem. We give the procedure for the part due to a space 
distribution of charge. Let F and ip be two scalar point functions 
which are continuous, together with their first and second spatial 
derivatives, in a given region. By applying Gauss’s theorem to the 
vectors F grad ip and ip grad F in turn, and subtracting, we have * 



(F grad ip)dS — £ {ip grad F)dS = J FA ipdr — J «/»AF dr. 



This is Green’s theorem. 

Let F satisfy the differential equation AF = — 47 rp. Let us try 

the effect of taking the function 1/r as the 
value of ip, where r is the distance of the 
element of charge pdr from the point P 0 at 
which we wish to evaluate F (cf. fig. 2). Then 
this function will be discontinuous at P 0 , and 
we must remove P 0 from the region of in- 
tegration. This is accomplished by drawing a 
small sphere of radius a with P 0 as centre. 
The surface of this sphere is a part of the 
bounding surface of the region of integration. 
According to the convention adopted in the 


s- — -^Sgradr ' 


Fig. 2 


* Gauss’s theorem (p. 24) is (j) vd8 =* J div vdr. 


Here v - V grad i p. By equation (88) (p. 38), 

div ( V grad </.) ~ (grad 0) (grad V) + V div grad 0 

= (grad ifi) ( grad V ) + FA«/». 

Similarly, for the function «/> grad V, 

div {iJj grad V) - (grad 0) ( grad F) + 0AF. 

Substituting in Gauss’s theorem and subtracting, we obtain equation (20). 
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proof given of Gauss’s theorem, the normals to this surface point out- 
ward from the region of integration, i.e. toward the centre of the sphere. 
The outer boundary is taken to be a sphere of infinite radius. The in- 
tegral over this infinite sphere vanishes : the surface element r 2 sin 6d6d<j) 
increases as r 2 ; but F varies as 1/r, and so grad F varies as 1/r 2 , hence 
the products F grad ip and ip grad F vary as 1 /r 3 , and so the surface 
integrals vanish for r = co . Thus there remains the integral over the 
small sphere of radius a, upon which F has the value 7 0 , the value at 
P 0 , in the limit. Denoting by n the unit vector directed toward the 
centre of the sphere, we have 

$ Sphere = f f ( v g rad - — - grad v\ na 2 sin Qd6d<p 

J o J o \ r r f 




since both V and grad F are finite at P 0 , and hence the contribution 
from the second term vanishes for a — 0. Then by equation (20) we 

have, since AF = — 4 t to and A - = 0, 

r 



In a similar manner we obtain the contribution of the surface charges 
by enclosing the surfaces of discontinuity in closely fitting surfaces 
which separate them from the remainder of the region. 


(b) Potential due to a sphere of radius a uniformly filled with 

space charge of density p 

We must distinguish here between the external space where the 
equation AF = 0 holds, and the interior, where AF = —iirp. For 
the exterior, the spherically symmetrical solution of AF = 0, viz. 



is valid. Using E dS = ine, A is found to be so that 


y _ tonPp , 

* 3 r * [y 


(23) 


For an external point, the effect of the sphere is the 

entvre charge were concentrated at the centre. This 
hold for r = a: 



(23') 
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In seeking a solution for the interior, which again can depend only 
upon r, it must be remembered that the additive constant in the 
potential has already been determined by the solution for the exterior; 
the two solutions must fit at r = a, since a discontinuity in V would 
mean infinitely great field strength. The field strength in the interior 
is again determined by the flux of force, by equation (4a), p. 267, to be 




The potential is obtained with the proper constant if we start with 


the equation 


7,-F a =/ r Eds=f 

J r<a J r 


- r‘*H£dr 

3 • 


<a 


By (23'), this gives 



27Td 2 p 



It may be noticed that the part of V , which depends on r has the 
same form, except for sign, as the potential of the force occurring in 
simple harmonic motion {Ex. 35, p. 88). If we assume a negative 
charge at P, then the algebraic sign is also the same, and this charge 
will execute harmonic vibrations in the interior of the sphere. 


(c) Potential due to a dipole 

A dipole or doublet is a system consisting of a negative charge 
— e and an equal positive charge -\-e separated by the elementary 
segment ds. The product eds is assumed to have a finite value m 
called the moment or strength of the dipole, i.e. if the distance apart 
ds is made smaller, the charge e is to increase in such way that the 
product eds remains finite. As follows from its definition, the dipole 
moment is a vector. If we denote the distance of any point P from 
the negative charge by r_ and that from the positive charge by r + 
we have 

V = e 

The change in the point function 1/r is obtained by taking the gradient. 

It must be noted, however, that in 
taking the gradient the “ field point ” 
P is moved, while the equipotential 
surfaces remain fixed. Thus we are not 
permitted to move the charge. Writing 
simply r for r_ in the difference, we 

obtain the following expression by moving the field point (cf. fig. 3). 
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whence 


V = 


edsr _ mr 
r 3 r 3 


m 


cos(mr), or V = 


mr 

47rA 0 r 3 


(27) 

(28) 


Thus the potential of a dipole decreases as 1/r 2 and depends on the 
angle which the radius vector makes with the axis of the doublet. 
The field is therefore no longer spherically symmetrical, but is rota- 
tionally symmetrical about the dipole axis. As in all the formulae 
occurring here, r is directed toward the field point. 


(d) Potential of a double layer 

A double layer is a surface covered with dipoles, each having its 
axis in the direction of the normal to the surface. All dipoles are 
assumed to be oriented so that one kind of pole is on one side of the 
surface. Let the dipole moment per square centimetre be m . The 
potential at a field point P is, by (28), 



m | cos (nr) 
^2 




If the layer is of uniform density ( | m | constant), 


V = 


= \m\f 


cos (nr) dS 


(29') 


The solid angle dQ subtended by a surface element dS as seen from P 

is defined as the area cut out on a unit sphere drawn about P by the 

cone determined bydS, and this has exactly the same magnitude as 
the integrand of (29'). Hence 6 


V = I m | Q. 


(29") 

J? the “mention regarding the direction of r, V is positive 
if P faces the positive side of the surface. Up to now we have con- 

POtentiaHteelf C °ff tmUltle / “ ^ Stren S th > but for a double layer the 
potential itself suffers a discontmuity and so the field would become 

mfimte m the layer if the latter really were infinitely thin This of 
course, never happens in practice. 7 * ' 1 

If nCXt a cl0Se , d double la ? er > sa 7 the form of a sphere 

!f,tth?w r7 + COneS ar u m from an external point, each onemust 

have opposite sign at the two places (*, aslsual, is L wTlfwS 

If * 1 . the oontributions nullify each other in pairs so 
that outside the surface the potential is zero. For an internal point 
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each nappe of an elementary cone cuts the surface only once; on in- 
tegrating, the whole surface of the unit sphere is covered. The sign of 
the contribution is negative, however, since yi and y are opposite in 
direction. Then 

V g -V [ = ±7T\m\, (30) 

or using the surface gradient, this may be written 

Grad V = 4tt m, or K 0 Grad V = m. . . (30') 

This equation holds also for unclosed surfaces. Any unclosed surface 
may be considered changed to a closed one by supplying an additional 
portion. The potential of the added part suffers no discontinuity in 
passing through the original surface, and so contributes nothing to 
(30'). Double layers arise, for example, when a metallic plate is dipped 
into a salt solution. The term “ electromotive force ” as used to denote 
the corresponding discontinuity of potential is obsolescent. 

B. Conductors 'present in the Field. 

(a) Spherical condenser 

Let there be given two concentric conducting spheres of radii 
a and h (h > a). Assume a charge e to be placed upon the inner sphere. 
The outer sphere is to be held at a potential V bi say by means of a 
conductor connecting it to earth. We may call this potential zero. 
Naturally, the spherically symmetrical solution of the potential equa- 
tion is again valid here, but the constants have values different from 
the previous ones. We have 

V = - + B. 

T 


By means of the total flux, the value of A is again found to be e; the 
value of B is found by imposing the condition that V = V b for r = h, 
whence 


TT 6 6 . TT T 7 ® 

r = -r-h +V >’ 01 V = i^r~ 
Hence for the inner sphere, 

V a = V„ + e (- - !Y or V a = 7„ + 



Now the electrical capacity of a system is defined as the ratio of the 
charge to the potential difference of the bounding conductors. Hence, 
for the spherical condenser, 



( 33 ) 
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If the radius of the outer sphere is allowed to become infinite we obtain 

G = a, or C — 4=7rK 0 a (34) 

That is, the capacity of a single isolated sphere, expressed in electro- 
static units, is equal to its radius in centimetres. In the M.K.S. system, 
the capacity of a sphere of 1 cm. radius turns out to be Ml X 10~ 12 
farad, the farad being the practical unit of capacity. 


(6) Point charge near an infinite plane conductor at zero 

potential. Electrical images 

Let the conductor be the zy-plane (fig. 4), and let the point 
charge be at z = a. In the space above the plate, the potential satis- 
fies the condition that V vanishes on the zy-plane, and that for every 
surface enclosing the point charge, 

( j ) (grad F)dS = —irre. 

In the space below the plate, it is required that V vanish at the xy - 
plane, and also that the integral j) (grad V) dS vanish for every closed 
surface, since there is no charge in this region. These requirements 



are fulfilled if we take V = 0 at all points below the plane. In the 

upper space 7 must also vanish at the a^-plane, but EdS must be 

equal to 4we for every closed surface round the charge. In this case 
the solution may be given readily; via. if we place a charge -e at 

th ® ° r T al ollar S e > the due to these two 
taT. " re ’"” d *»«~ » th. upper ,p.„„. Hence 

f =‘(H) 


. . (35) 
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This expression, which is a sum of potentials due to point charges, is 
certainly a solution of the differential equation for the potential. 
Moreover, for the ary-plane, r = r\ and so V vanishes there. Finally, 
closed surfaces drawn in the upper space must either contain e or else 
contain no charge whatsoever, since the charge — e lies below the con- 
ducting plane. All lines of force originating on e terminate on the 
xy-plane. Since there is no field below this plane, the field strength 
is discontinuous at the ary-plane, and so a charge must be present on 
this plane. In order to determine this charge we first calculate the 
field strength at the surface of the conducting plane: 

E = —grad V = ~ r 0 — ^ S 0 . . . . (36) 


Since r = r' at the surface of separation, the resultant is a vector E 0 , 
normal to the xy-plane, having the negative z-direction and of mag- 
nitude (see fig. 4) 



Hence by equation (7a) (p. 267), the surface density of,- charge is 


a = — 


ea 
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(37) 


By computing the integral (p adS we may readily verify that the total 

charge induced on the conducting plane is equal to —e, corresponding 
to the fact that the entire flux emanating from -\-e terminates on the 

conductor. 


Ex. 71. Starting from first principles, compute the force of attraction between 
the two plates of a parallel-plate condenser, each carrying a charge of magnitude 
e. (Make use of the lines-of-force picture and neglect disturbances due to the 

edges of the conductors.) 

Ex. 72. Following the method used for a spherical condenser, calculate the 
capacity of a cylindrical condenser consisting of two coaxial cylinders of length 
l and radii a and 6 (6 > a). Find also the capacity of a parallel-plate condenser 
consisting of two equal plates of area S placed at a distance d apart. .Neglect 


edge effects. 

Efcr 1% A point charge e external to a conducting sphere is at a ^nce 
R from the centre. The sphere is held at zero potential. Find the potential at 
any point in the resulting field. (As in the case of the plane, the effect of the 
sphere may be replaced by that of a virtual charge within it.) _ 

Ex. 74. What is the nature of the equipotential surfaces at a singular point 
in the field where E = 0 ? Take the origin of a co-ordinate system at this point 
and develop the potential in a power series in the co-ordinates. Consider especia y 

the case of rotational symmetry. 



CHAPTER XII 


The Electrostatic Field in Dielectric Media 


1. Formal Introduction of the Concepts “ Dielectric Displacement 99 and 
“Free Charge Boundary Conditions at the Surface of Separa- 
tion of Two Dielectrics. 

We start with the following experimental arrangement. A parallel 
plate condenser is connected to a quadrant electrometer which indi- 
cates the potential difference between the plates. First, let the sub- 
stance between the plates be air. If we now introduce into the space 
between the conductors a slab of insulating material, such as a flat 
piece of hard rubber or a glass plate, the potential difference will be 
observed to decrease: upon removing the slab, the potential will 
return to its former value. According to our definition of capacity, 
this quantity has been increased by introducing the insulator. Since 
the introduction of the insulating material could not have altered the 
charge on the conductors, we are led to conclude that the connexion 
between field strength and charge, found in Chap. XI, § 2 (p. 265), must 
be modified if the space in which the field operates is filled with a 
medium other than air. Indeed, we find even a small effect due to 
the air if we evacuate the space between the plates, but this effect is 
relatively so small that it may generally be neglected. However, we 
shall take the results of Chap. XI to be strictly valid only for a vacuum.* 
The ratio of the field strength in vacuum to that in an insulator 
(or dielectric), for the same distribution of charge, is called the specific 
inductive capacity or dielectric constant of the medium, i.e. 


E„=ZE. . 


(1) 


( 2 ) 


We now mtroduce a new vector D, the dielectric displacement vector 
The same relationships are to hold for this vector as for the vector E 
in a vacuum, viz. 

$ DdS = 4 : 776 , divD =4 t rp t DivD = 4 tt< 7, 
while in a vacuum, f E v dS = 4ttc, div E, = 4 t r/>, Div E„ = 4 t to. 

In the M.K S. system, the factor 1 /K 0 appears in place of 4 tt on the 
right side of the equations. 

8 tt Ttt ° UUm <md fM ^ di ° 0r b * 1658 000 of 1 per coot *t 
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On account of (1), ttie connexion between D and E is given by 

D = KE (3) 

This simple relation between D and E is valid only for isotropio substances, 
i.e. amorphous materials and regular crystals. For certain anisotropio dielectrics, 
generally of crystalline structure, the relation (1) must be modified. In such 
substances the field strength E is a linear function of the vacuum field strength, 
i.e. the dielectric constant is no longer a scalar, but a tensor. We shall, however, 
restrict our considerations at present to isotropic substances. 

On the other hand, the field within a dielectric (which actually 
may be measured, say by the force on a small test body) may be con- 
sidered to be the field of certain fictitious charges which would produce 
the same field in a vacuum. Such charges are called “ free ” or 
“ apparent ” charges. These charges, which we shall denote by accents, 
are given by the equations 

(£ E dS = 47re', div E = ±np' f Div E = ina. . (4) 


True charges occur either as space charges or as charges on the surface 
of conductors. On the other hand, free charges are found also on the 
surface of separation of two dielectrics, as we shall see at once. Since 
no true charges exist at the interface of two dielectrics, we have 

Div D=K 1 E n (1 > - K 2 E n < 2 > = 0 (5) 


But this means that DivE is different from zero, since it is only the 
products of the normal components of E by the respective 2Ts which 
are equal. Hence the surface of separation of two dielectrics is the seat 
of apparent field-producing charges : To secure information regarding 


the tangential components of E we 
make use of the empirical fact that 
the electrostatic field is everywhere 



Fig. i 


irrotational, even if there are several 

contiguous dielectrics in the region. If, then, we carry a unit 
charge over a closed path drawn near the surface of separation of two 
dielectrics, and with one branch on each side of this surface (fig. 1), 
the line integral must vanish. This means, however, that the tan- 
gential components of E, which alone contribute to the integral, must 
be equal on the two sides: 

E (1) = E (2) (6) 

%an tan* * 


This equation may be written also in the form 

[n (E 2 — EJ] = Curl E = 0 (6') 

The continuity of the tangential component of the electric field strength 
at the surface of separation of two dielectric media is not limited to 
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electrostatic fields, but is a general property of the electric field, and 
will be justified later (p. 344) in a much more general way. 

The two boundary conditions (5) 
and (6) furnish us with the law of 
refraction of lines of force. If we 
consider a plane (fig. 2) determined 
by the tangent to a line of force 
in the first medium and the normal 
to the interface, it follows from 
(6) that E 2 must also lie in this 
plane. Further, if <j> is the angle 
between the tangent to the line 
of force and the normal to the inter- 
face, it follows from T)iv(KE) = 0 
that 



Fig. 2 


E x cos fa = K 2 E 2 cos fa, 
and from Curl E = 0 that 

E x sin fa — E 2 sin fa. 

Dividing one equation by the other gives the law of refraction for 

lines of electric force: . . r , 

tun (p j 


tan fa K 2 


( 7 ) 


Ww 


D 


-E 




dl 


i 


t 


2. Polarization of Dielectrics. 

We begin with a homogeneous dielectric medium in which a 

uniform field E exists. The density 
of the field lines of the vector D 
is K times that of the field lines 
of E. We now take a small cylin- 
der whose generators are parallel to 
E and remove the contained material 
of dielectric constant K, at the same 
time placing charges on the bases of 
the cylinder in such a way that the 
field is unaltered. Inside the cylin- 
der we now have D = E, while out- 
side this region we still have D = KE. 

Since E is to have the same value 
at all points, lines of .displacement originate or terminate on the ends of 
the cylinder. Thus the ends carry true charges whose surface density may 
be calculated by equation (2) (p. 277). On the upper surface in fig. 3 

a + =i-(Z| E |-| E |)=^ 1 |K|, 


Fig. 3 
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while on the lower surface, 

Hence the volume element dr = S dl which was removed has a dipole 
moment amounting to 

dP = E S dl = E dr. . . (8) 

4t r 477 v ' 

The dipole moment per unit volume is called the 'polarization P: 

p = ^- 1e =« e 0) 

and we thus have 

D= EE= (1 + 4t7/c)E= E+ 4t7P. . . . (10) 

The newly-introduced constant of the material, k , which, by (9), 
stands in simple relation to K , is called the electrical susceptibility of 
the medium. In this manner we can replace all elements of volume of 
the dielectric by dipoles of moment /cEdr. 

The atomistic interpretation of the dielectric constant indicates that the 
vector P is the truly physical quantity, while D appears merely as an auxiliary 
mathematical quantity. As will be explained in detail later (p. 450). a resultant 
dipole moment arises in two ways in a volume element containing a large number 
of atoms or molecules. If the elementary particles are themselves dipoles (mole- 
cules with resultant positive and negative constituents), the random nature of 
their orientations makes the resultant moment zero, provided no external field 
acts. An applied field tends to bring the dipoles into its own direction, while 
thermal agitation tends to destroy this orderly arrangement. The greater the 
field, at a given temperature, the greater the resultant moment in the direction 
of the field. On the other hand, it is clear that for a given external field, this 
resultant will become smaller as the temperature is raised. This type of sus- 
ceptibility, in analogy with the more familiar corresponding magnetic quantity, 
is called “ para-electric If, on the other hand, the elementary particles have 
normally no dipole moment, the field will produce one by distorting the electron 
shells. Of course this effect, which is independent of the temperature, occurs 
also in the case of para-electric susceptibility, but is masked by it on account of 
its smallness. If the entire dipole moment of the particle is thus produced by the 
applied field, we speak of “ dia-electric ” susceptibility. 

If. in addition to the distribution of the true charges, we know 
the polarization as a function of position, then the electrostatic poten- 
tial may be found by integration over the volume and surface elements. 
Since according to equation (28) (p. 273), the potential of a dipole of 
moment P dr is given by V = (Pr/r^jdr, 

* In the M.K.S. system k=* K- \ and D = K 0 E + P, but all tables of * are given 
in terms of the definition (9). 
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the complete potential when dielectrics are present is given by 



The volume integral contributed by the dipoles of the polarized 
dielectric may be transformed into a surface integral as long as we 
are dealing with homogeneous bodies of uniform dielectric constant. 
Here divP vanishes when no volume charges are present, according 
to equation (3) (p. 278) and equation (10) (p. 280). We therefore 
have 

^ ^ = /J divP dT + / p grad * l dr= fi* dr - ■ (12) 


The field point is fixed during the integration, and the integration 
is extended over the several volume elements of the region. Conse- 
quently, in taking the gradient, 
— §. C the field point is to be held fixed, 

grads r " - ' grad f r while the source points » are 

4 moved. This is indicated in (12) 

by the subscript s. According to 
ng. 4, the gradient of r so obtained is equal and opposite to that 
obtained by moving the field point. We denote the former gradient 
by the subscript 5, the latter by the subscript /. As before, the vector 
extending from the charge to the field point is denoted by r* 

. 8lL rf ace integral in equation (12) can also be given another 

mterpretation: If an insulator is adjacent to a vacuum (p = 0), then 


PdS = — DivPdiS, 

and by equation (10) (p. 280), since Div D = 0, 



DivE 

~ST 


dS = a' dS. 


J 



Hence there are free charges on the surface of the dielectric as a resul 
P ola nzation th ® re > aud we may imagine the added field to ha v. 

;VanTe q lttn(lt rgC3 ’ ^ “ gi ™ * 

to t: 

point has the ? t ° f point. Thusf if the eouro, 

We and the field point has the co-ordinates z, y t z , w ( 

r " + (v - V ) 1 + (*- rn 

so that dr fix - -0r/0£, &o. 
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3. Simple Examples of the Electrostatic Field in Dielectrics. 

Equation (11) gives the potential, from which the field may be 
derived, if the distributions of the charges and of the polarization are 
known. But usually the polarization is no more known as a function 
of position than is the distribution of the charges (cf. p. 268). Since 
P = kE, we must know the field strength E, which has been altered 
on account of the presence of the dielectric. The problem admits of 
exact solution in only a few cases. In the absence of volume charges, 
we are concerned with finding a solution of AF = 0 which satisfies 
the conditions that (1) F has constant values on the surfaces of the 
conductors, and (2) at the surface of separation of two dielectrics, 
Curl E= 0 (continuity of the tangential components), and Div(ZE) = 0 
(prescribed discontinuity of the normal components of E). As an 
example of the problems permitting of exact solution we consider the 
distortion of an initially uniform field E 0 caused by the introduction 
of a dielectric sphere. We make the obvious assumption that the 
polarization within the sphere is homogeneous and parallel to E 0 , 
but we do not yet attempt to determine the magnitude of P. We 
can show that, by suitable choice of the value of P, the added field 
arising from the dipoles of the polarized sphere, together with the 
original field, can be made to satisfy the boundary conditions. We 
may imagine the existence of the field of the uniformly polarized 
sphere to be due to a sphere homogeneously filled with charge of 
density — p and another filled with charge -f -p, the centres of the two 
being separated by ds. In order that every element of volume may 
have the dipole moment P dr, we must have 

P dr = pdrdSy (14) 


i.e. 


ds = -. 


Outside the sphere (r > a) the added potential V 3 is, by p. 261, 

ds r\ __ ind 3 P r 

“T? 


F. (i0 = - ^ P + ^ p 


3 r 



+ 



(15) 


Since grad (P r) is equal to P when the vector P is constant,* the total 
field in the external space is 

E '» = E 0 - grad 


4wa 3 i-nd'Tr 
- E ° 3r 3 P + r 5 r 


(16) 


♦ This may bo seen intuitively from the meaning of the gradient, or formally 
from Ty a PzZ + P*V + Pf- ■“ u by calculating the components dujdz, &o. f or by 

taking the x-axis parallel to P. 
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The field is therefore no longer constant in direction, for there is a 
component directed outward along the radius vector from the centre 
of the sphere (fig. 5). 

Inside the sphere, the added potential 
is. by equation (25) (p. 272), 



r; n = -2 np («« - Q 
+ 2 Trp (a 2 — r - 


2 2P y 

I rv 


\ 4:77 

) = 3 Pr * 


whence 


E<7> = E 0 - ^ P. 


(17) 


(18) 


Provided that our solution satisfies the 
boundary conditions, the internal field 
remains homogeneous, since P is assumed 
to have the direction of E. Thus we 
have in addition to the applied field E 0 
another field of strength 

477 _ 


oppositely directed, whose source is the system of free surface charges 
connected with this field by equation (13) (p. 281): 

a'=-DivP (19) 

In the analogous magnetic case, one speaks of the demagnetizing effect 

of the boundary surfaces, and the numerical factor in (18) in IS for 

the sphere— is called the demagnetization factor for this particular 

We th ? magnitude of P at 0UI di sP°sal in order to satisfy 

/t SSF C ° ndltl ° ns - Denoting the angle between r and Z 
(ng. 5) by 6 , we have for the tangential components 

EJ* = *o sin 0 - g 3 P sin 6 = EJ* = E a sin 8 - ^P s in 9. (20) 

MfiUed nt Sh° f ^ t , angential 00m P 0Ilen ts is thus automatically 
fulfilled. For the normal components, the equation to be satisfied is 

W = K,E 0 cos* - K,^P cosd + R a *gp„ 9 

= ^/E„ < "= E,E 0 cos9-~E,Poos9. 

9 


• • ( 21 ) 
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For the present case, where K E = 1, K t = K, this gives the following 
relation, which may be written at once in vector form since E 0 and P 
have the same direction: 



3 F K-l 
477 K + 2 



By substituting this value in equations (16) and (18), we obtain the 
field strengths within and exterior to the sphere. The former is 



K+ 2 


E 0 . 



For in fini tely large dielectric constant K ly E (f) vanishes. In electro- 
statics this is equivalent to the case of a conducting sphere. Hence 
putting K = co in equation (22) gives the polarization of an insulated 
conducting sphere placed in a uniform field E 0 : 



The total dipole moment of such a sphere is then 





The polarizability, i.e. the ratio of dipole moment to field strength, is 
thus equal to the cube of the radius. 

Ex. 15. In a uniformly polarized dielectric let the material within a sphere 
of radius a be removed without disturbing the polarization at any other place in 
the rema inin g medium. Determine the field produced at the centre of the sphere 
by the surrounding medium. (Observe that the supplementary field may be 
ascribed to free charges on the surface of the sphere.) 

Ex. 76. Calculate the capacity of a spherical condenser whose dielectric has 
a specific inductive capacity K x from r = a to r = b and one of magnitude 
from r=6tor=c(a<6<c). 

Ex. 77. A point charge is near the infinite plane face of a dielectric medium. 
Calculate the field at points outside the dielectric and within it. (Here, as at p 
275, we can fiad a solution satisfying all the boundary conditions by plaoing a 
oharge of suitable magnitude at the “ image point” of the given charge.) 



CHAPTER XIII 


Energy and Ponderomotive Forces in the 

Electrostatic Field 


1. Potential Energy of Systems of Charges in a given Field. 

In contrast with the procedure in the preceding section, we consider 
the electrostatic field E to be given here, and introduce into it charges 
which are so small that they do not influence the distribution of the 
charges which produce the field. 

The potential energy of a simple point charge at a place where 
the potential of the field is V is 

u e = tV (25) 


If we have a dipole, and if the potential at the negative charge is V, 
the potential energy of the doublet is 

u d = —eV -f- e[V -f ds gradF] = — edsE = -wE. (26) 


In a homogeneous field the potential energy can change only by 
changing the angle between wi and E, and for pure translation the 
energy remains constant — i.e. a parallel displacement of a dipole in 
a uniform field involves no work. But since work is represented by 
the scalar product of force by displacement, this means that in a 
homogeneous field no force acts upon a dipole, ten din g to displace it. 
On the other hand, a rotation of amount 80 requires an amount of 
work M 80, where M represents the magnitude of the torque acting 
upon the dipole. Hence 


M86 = — 8u d = —E | vn | sin 080, 

turning moment acting on the dipole is there 

M= — | m | EsmO. . , 


(27) 

The negative sign means that the direction of the torque is that tend- 
ing to decrease 0. 

So far we have considered the dipole moment to be constant, but 
the case where the dipole moment is proportional to the field, (i.e. 
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rn = aE) is equally important. A force eE acts upon the positive 

charge, tending to pull it away from the negative charge, considered 

fixed. At equilibrium, this force is balanced by an equal and opposite 

internal force. If the field increases by dE, the change in energy is 

equal to the change of energy of the dipole [equation (26)], plus the 

work done in displacing the charges by dr against the internal forces. 
That is, 

du d = —mdE — E dm + eEdr = —mdE — — aEdE. 


The total energy of the dipole induced by the field is obtained by 
integration from 0 to E: 




Ex. 78. Find the translational force acting upon A dipole of given orientation 
a non-homogeneous electrostatio field. 


2. Total Energy of the Electrostatic Field. 

In electrostatics there are two different but essentially equivalent 
ways of considering the total energy of the field. The older method, 
which we shall call the theory of “ action at a distance ”, considers 
only the charges and the forces operating between them, without 
attempting to explain how such action is brought about. On the 
other hand, the theory of direct action locates the energy in the inter- 
vening space also 

there is also a certain continuous distribution of energy. (The picture 
suggested is that of an elastic body, say the ether, in a state of stress. 
In spite of its great heuristic value, we can no longer ascribe physical 
reality to such a picture.) The superiority of the theory of direct 
action becomes apparent in the case of rapidly-varying fields, for it 
leads simply and naturally to the existence of electromagnetic 
waves. 

We begin, however, with the theory of action at a distance, and 
consider the potential energy of two point charges ^ and e 2 in a vacuum. 
This is given by the potential of the Coulomb force, and so may be 
written down at once: 

u=^, or (M.K.S.) « = ... (29) 

f*i2 477-A 0 r 12 

If there are other point charges, we can calculate the total energy 
by starting with the charge 1, bringing in charge 2 from infinity to 
the distance r 12 , then bringing up charge 3 from infinity until it is at 
prescribed distances r 13 and from these charges, and so on. The 
total work done in thus building up the field — i.e. the potential energy 


, and implies that wherever an electric field is present 
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U z= f _|_ 6 

\ r i2 hs r 


& + ..) 

r l4 / 


+ /^3 + * 

V 23 r 


r 24 /V *31 / 


= or \ YS S' 


e i e fc 


» k T ik 


i \k 4srrK 0 r ik 


(30) 


The factor 1/2 occurs before the double sum on account of the fact 
that every combination of i and k occurs twice in taking the double 
sum. The accents on the summation signs indicate that the com- 
binations i = k are to be omitted. Now 


S' or, in M.K.S. units, 2' 

« r ik i 477 K 0 r (k 


signifies the electrostatic potential, at the position occupied by the 
charge k t caused by all the other charges. Thus we may write 

U = (same in M.K.S. units). . . (31) 


The idea may be extended to include continuously distributed charges. 
For a volume charge, e = pdr: for a surface charge, e = adS. Hence 
the energy when volume and surface charges are present is 

U = ifVpdr + VvdS (same in M.K.S. units). (31') 


In the theory of direct action we must ascribe a given energy con- 
tent to every element of volume of the region where an electric field 
exists. The most natural assumption is that the energy density u (the 
energy per unit volume of the field) is proportional to the square of 
the electric field strength, for E 2 is the simplest scalar with which the 
scalar u may be connected. It is easy to verify, also, that E 2 has the 
dimension of energy per cubic centimetre in the electrostatic system. 
The factor of proportionality is to be determined in such a way that 
numerical agreement with the theory of action at a distance results. 
The equivalence of the expression so determined to (31') is easily 
shown. Thus, if we apply Gauss’s theorem to the product EF, and if 
we remember that E vanishes as 1 /r 2 and F vanishes as 1/r at infinity, 
the integral over an infinite sphere is zero, so that 


Since 


0 = 

E = —grad F and div E = 4 t rp, or = p/E 0 , 


f^(EV)dS = f E grad Vdr-\~J V div E dr. 



J E?dr iTTVpdr, or =~ JVpdr, 



we have 
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On the other hand, if only volume charges are present, the total energy 
of the field is, by (31'), 


U = b J V p (It. 


We obtain numerical agreement if we take the energy density in a 
vacuum to be 



E 2 (C.G.S.), or u = p- 0 E 2 (M:K.S.). 



If surface charges — and henoe surfaces of discontinuity of E — are 
present, these are to be excluded from the field by closely-fitting 
closed surfaces, when applying Gauss’s theorem. This is the origin 
of the second integral in equation (31'). 

In a medium where the dielectric constant is K , the force between 
two point charges is decreased in the ratio 1 : K, and the theory of 
action at a distance gives 


U = i S' 2 

• k 


/ e i&k 

Kr ik 


^K'L' S' 

i k 


K K r ile ‘ 



But e k IK is the apparent charge e k , from which the field in the medium 
of dielectric constant K may be derived, just as the field in a vacuum 
may be derived from the true charge e. Passing to a continuous 
distribution of charge, we write 

U = ^ J Vp'dr+~ f Va'dS. . . . (35') 

If we again take u proportional to E 2 in the direct action method, the 
same transformation as was used in equation (32) gives 

M = l KE 2 = } ED, or « = E 2 . . . (36) 

077 077 ^ 

In the second form, the equation is also valid for anisotropic sub- 
stances, where E and D have different directions. _ 

We shall next consider the change in the energy of a field which 
results from introducing a dielectric body. The body will be assumed 
to be so small that its introduction does not alter the distribution of 
thd charges producing the field. First let there be given a vacuum 
field E 0 = D 0 . By bringing in the dielectric body, we produce a new 
field E and a new displacement D at every point. Hence the energy 

change is given by 

V - O 0 = ^ J (ED - E„ D 0 ) dr. . . . (37) 
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This expression may be rewritten as follows: 

ED - E 0 D 0 = E (D - D 0 ) + (E - E 0 ) D 0 . 

It is a fundamental property of the electrostatic field that E is irro- 
tational throughout the entire region (p. 267). Therefore we may 
set E equal to —grad V. Again, div (D — D 0 ) is everywhere zero, and 
hence D — D 0 is free of sources (“ solenoidal ”), since the true charges 
— the sources of -D — were not changed by the introduction of the 
dielectric substance. Moreover, E and D vanish at infinity as 1/r 2 , 
and V vanishes as 1/r. Hence Gauss’s theorem gives for integration 
over an infinite sphere: 

0 = £ F(D - D 0 )<iS = -^(D - D 0 )dr +f V div(D — D 0 )dr. 
Since div (D — D 0 ) vanishes everywhere, it follows that 


and 


/e(D- D 0 )<Zt = 0 

U-U 0 =±-f(E-E„)V 0 dT. . . . 


(38) 


But for the same reason, the integral J E 0 (D — D 0 ) dr vanishes also. 

If we multiply this integral by — 1/8 tt and add it to (38), we 
obtain 

& 

D - V « = / ( ED o - E^dr = i / (EE 0 - E 0 D) dr 

= -hf^±EE 0 dT, (39) 

since D 0 = E 0 by hypothesis. Prom the definitions of electrical sus- 
ceptibihty and polarization (p. 280), 

V - V o = -if K EE<,dr=-lfpE 0 dT. . . (40) 

If the susceptibility is small, EE 0 may be replaced by E 2 without intro- 
ducing appreciable error; i.e. the alteration of the field may be 
neglected. If we make this substitution in the first integral of 1401 
we recognize at once the meaning of this expression. In a field EL 
every cubic centimetre of the dielectric has a dipole moment #cEL 
and by § 1 ( p . 286) the energy of a dipole induced by the field is givM 


K 

2 


- - V. 


The change in the field within the dielectric, as calculated for the 

(B7U) 
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sphere on p. 282, causes the scalar product EE 0 to appear in place 
of E 0 2 . 

This decrease of the field energy for constant field-producing charges, 
easily verified in the case of a parallel-plate condenser with and with- 
out dielectric, must not be confused with the increase in the energy 
content of a dielectric when the field is applied. In the latter situation 
the field-producing charges increase. According to (36) the change in 
energy density is 


du= ~ ErfE = - EdE = -L EdE + EdE. 

4:77 4t77 477 




The last form shows that the second term represents the change in 
energy of a material body, in which P =J= 0, while the first term repre- 
sents the change in energy of a vacuum. 

Ex. 79. Show that the energy of a charged condenser is \CV 2 and compute 
the change in energy when a dielectric is introduced — in one case with the charge 
on the plates held fixed, in the other case with the potential difference held con- 
stant. 

Ex. 80. Compute the work required to put together infinitely dispersed 
charges to form a sphere having a uniform charge density p and a radius a. 


3. Forces and Equilibrium in the Electrostatic Field; Theory of the 
Manometer Method for the Electrical Susceptibility of a Liquid. 

In mechanics, the position of equilibrium of a system may be 
determined by the condition that the potential energy must be a 
minimum. Now, according to § 1 (p. 285), the energy of the electro- 
static field is equivalent to the potential energy of the charges, and 

so equilibrium is determined by the con- 
dition that the sum of the electrical field 
energy and the mechanical potential energy 
(e.g. that of gravity) is to he a minimum. 
We illustrate this principle with the theory 

of the manometer method for determining 

the electrical susceptibility of a liquid, which 
is especially important in the analogous 
magnetic case. A U-tube of cross-section 
S is filled with a dielectric liquid. Let 
an electric field, normal to the axis of 
the tube, be applied in the region of the 
meniscus in one arm of the tube (fig. 1). The lines of force are 
normal to the plane of the diagram, their trace m that plane being 
indicated by dots. Application of the field causes the meniscus 
in this side of the tube to rise. If the density of the liquid is p, 
and if the equilibrium height of the meniscus in the absence of the 
field is h 0 , the work done against gravity amounts to Sp(h A 0 ) g, 
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for the mass Sp(h — h 0 ) has been moved from the right side to the left, 
whereby the centre of gravity has been raised a distance h — h 0 . If 
the susceptibility is small, so that E may be set equal to E 0 , the total 
energy becomes [cf. equation (40) (p. 289)] : 

U = U 0 -$ [ k KE*Sdh + Sp{h-h 0 Yg. . . (41) 

•'o 


This means that the total energy depends only upon h ; 
minimum condition gives 



dU 

dk 


= -ME 0 2 ]*S + 2 Sp{k — h 0 )g, 


hence the 



and the difference in level of the menisci is 


2-W-V-g. (43) 

As is evident from the analysis, E 0 signifies the strength of the field 

applied to the meniscus on the left; the field need not be measured 
at other places. 

. In many cases it is not so important to know the position of equili- 
brium as it is to find the force actually operating on the system. For 
example, we may wish to find the motion of an uncharged oil drop in 
a non-homogeneous field, and must therefore know the forces acting 
The droplet is assumed to be so small that the field may be con- 
sidered homogeneous within the spherical volume (radius a) which 

it occupies. Then, by equation (22) (p. 284), the polarization within 
the sphere is 

P - 3 g -i E 

4w K + 2 


and by the exact formula (40) (p. 289), the electrostatic energy is 

” - V. - »/PHA _ (7„ - J £ .. Pii _ O. _ * v . m 

Since we assume the droplet to be so small that we may reckon with 
a constant value of E 0 within a volume of this magnitude the 

If the fit?* ^ de P en d on the position of the cStre of the sphere 
If the field E 0 is given, there is a definite value of the enerev ro™ 

X®* 111 , 6 10 . P° sition the centre of the droplet. Points^ which 
value is the same may be connected by level surfaces If the 


iV - («“ d 0)* j (jrri Vife 
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On the other hand, the force acting on the sphere does work 

dW = F ds. 

Since this work must be equal to the decrease in potential energy, 
we have 

3 Z7 1 

F = 2 jf^r 2 513(1 E ° 2 (46) 

Since K > 1, the direction of the force is that in which E 0 2 increases; 
hence the droplet is urged toward regions of greater field strength. 
In magnetism, where we shall find far-reaching analogies with electro- 
static phenomena (Chap. XV), the magnetic permeability /x is the 
counterpart of K. We encounter values of /x less than 1 as well as 
greater than 1, depending upon the nature of the substance. Spheres 
of those substances for which /x < 1 will be urged out of the 
magnetic field, and this difference in behaviour has led to the 
classification of magnetic materials into paramagnetic and dia- 
magnetic substances. 

Ex. 81. The Absolute Electrometer. — A flat metal plate of area S is suspended 
from one of the two equal arms of a balance so that it is at a distance a from 
another horizontal plate of the same size. What weight must be added to the 
other side of the balance to maintain equilibrium if the two plates are charged 
to a difference of potential V when the medium between the plates is (a) air, 
and ( b ) a material of dielectric constant K ? 


CHAPTER XIV 

Stationary Electric Fields and Steady Currents 


1. Ohm’s Law. 

We know from what has gone before that it is impossible to main- 
tain an electrostatic field in a conductor. However, by continuously 
supplying energy, a field which is constant in time can be main- 
tained. This field differs from the electrostatic field in that, in the 
first place, it is accompanied by a continuous evolution of heat. 
But there is another difference: an actual transport of electric charge 
takes place in a field of this kind, as may be verified directly, for 
example, in the case of electrolytes. This continued motion of electric 
charges, which we shall refer to as an electric current , is also made 

evident by the magnetostatic field which accompanies it; this will be 
discussed in the following chapter. 

T he current strength, or simply the current I, is defined as the amount 
of electricity passmg through a cross-section of the conductor in unit 

vURQl 

T de 

I= dt (i) 


dmsUy a vector “ the direction of motion of the 
electricity, whose magnitude is equal to the amount of electricity 

tZT+ 1 P6r “Ti thro ' 1 8 h uni * are » of cross-section normal to 1 
Then the current through any section is given by 


—fids =J\ i | cos (i, H)dS. 


.... ( 2 ) 

hlnce the * *** dependTp^the'temp'r^ 


'-1 


• ( 3 ) 




constant 
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in spite of the evolution of heat accompanying the passage of the 
current. In many instances the form 


V = IR 



Eas more direct physical meaning. This way of writing the equation 
states that if we have any conductor carrying a current, the potential 
difference between two points between which there is a resistance R 
is IR. If we have a closed, unbranched circuit consisting of various 
resistances, the first of Kirchhoff’s laws of electric circuits says that 
the algebraic sum of the potential differences across the conductors 
is equal to the sum of the electromotive forces (batteries, generators, 
&c.) applied in that circuit. The other law of Kirchhoff states that at 
any junction, the sum of all inflowing currents must be equal to the 
sum of all outflowing currents. This is merely a special case of the 
absence of sources or sinks (solenoidal property) for electric currents, 
and is almost self-evident. 

For the theory of direct action we require a formulation of Ohm’s 
Law which connects only the quantities localized at one particular 
point. For a cylindrical conductor the resistance R is proportional 
to the length and inversely proportional to the area of cross-section. 
The resistance offered by a centimetre cube of material to a current 
passing between opposite faces is called the specific resistance or resis- 
tivity p of the substance; its reciprocal, o-, is called the conductivity. 
Hence the expression for the current in a cylindrical conductor may 
be written 

l = ( 5 ) 


Now F/Z is equal to the field in the direction of the generators of the 
cylinder and I/S is the current density. Since the current is in the 
direction of the field for isotropic bodies, these considerations hold for 
any small cylinder drawn within a conductor carrying a current. Hence 
(5) may be written in the vector form 

i = ctE (6) 

The fundamental implication of Ohm’s Law is that, for conductors 
such as metals and electrolytes to which it applies, a is independent 
of the field strength. 

The electric field strength is an irrotational vector in current- 
carrying conductors, just as it is in electrostatics. Also, there are no 
surface vortices on the surface of separation of two conductors: 

Curl E = [w(E 2 — E x )] = 0 or E^ = E tan (1) . . (7) 

But the normal component of E experiences a discontinuity at the 
interface; the surface divergence of i must vanish, since any charge 
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which enters the surface from one side must emerge from the other, 
so that 

Div i=n(z 2 — = n{a 2 E 2 — ^ E x ) = 0 or <r 2 E n (2) = a x E„ (1) . (8) 

Thus the change in direction of the lines of flow at the surface of 
separation of two conductors is the same as that of the electrostatic 
lines of force at the surface separating two dielectrics (p. 279): 


tan dj 
tan (X 2 


_ CT i 


(9) 


Likewise, no lines of flow can originate within the conductor, so that 
we have 

div t = div (crE) = —a div grad V = — o-AF = 0. . (10) 

This means that inside homogeneous conductors V satisfies the same 
differential equation as it does in the electrostatic field in free space. 
Moreover, the boundary conditions are also the same, except that the 
constants have a different meaning. This property of steady fields may 
be used to map the field experimentally. If it is desired to deter min e 
the static field about a body of arbitrary form, this can be done by 
immersing a metallic model in a large metal basin filled with an elec- 
trolyte. The body is connected to one pole of a source of potential, 
and the basin to the other. With the help of a Wheatstone bridge the 
equipotential surfaces may be determined with ease. Since the con- 
ductivity of the solution is small compared with that of the metal, 
the condition that the surface of the body be an equipotential surface 
is practically realized. 

Ex. 82. A condenser of capacity O is at the potential V 0 at time t = 0. If 
it is discharged through a resistance R, calculate the resulting current. 

Ex. 83. A condenser C is oharged through a large resistance R by means of 

a source of direct potential F 0 . A glow lamp having a striking potential V, is 

connected in parallel with the condenser. What is the frequency of the resultinc 
pulsating current? * 


2. Generation of Heat in a steady Electrio Field. 

Accra-ding to p. 267, the work done when an electrio charge e is 
moved from a pomt where the potential is 7, to a point at potential 

r 2 IS 

17 = e(V 1 7 2 ) ( 11 ) 

to move, this work appears as kinetic 

HI. ?T d7 t fie d f. mthul conductors, the work is transformed 
mto heat, lie atomistic view of conduction interprets this heat as 

^"7 from , Motional motion of the charges. Hence, in a 
section of a conductor bounded by two equipotential surfaces at 
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potentials V x and F 2 , the amount of heat generated per second by a 
current I is 

W = I(V 1 - V 2 ), (12) 

measured in energy units, since I is the quantity of electricity which 
passes through a potential difference F x — V 2 each second. Using 
Ohm’s law to eliminate the potential difference from (12), we have the 
usual form of the equation for the so-called Joule heat loss, 

W= I 2 R (12a) 

From the view-point of the theory of direct action, a differential 
formulation is desirable. Let us take a volume element in the form 
of a small cylinder whose lateral surface is formed by lines of flow and 
whose ends are portions of equipotential surfaces. If the altitude of 
the cylinder is h and the cross-section is S, then the potential difference 
between the end surfaces is 

Fj — V 2 =h\ grad F | = h | E | {V x > F 2 ). 

The current is I = S | i | , so that the quantity of energy dissipated as 
heat in the volume element dr each second is 

*2 

dPT =|*'| |E|5A=<TE 2 ir=idr. . . . (13) 

It may be shown without difficulty that this equation is true for an 
element of volume of any kind* 


CHAPTER XV 

The Magnetostatic Field 


1. Comparison of Electrostatic and Magnetostatic Fields. 

Besides electrical phenomena, there is a second type of non- 
mechanical effects whose close connexion with electricity was recog- 
nized at an early period, viz. magnetic phenomena. Just as an electro- 
static field can be maintained without energy supply, so there exist 
also magnetic fields which persist without the necessity of supplying 
energy. The field between the poles of a permanent magnet is of this 
type, also the magnetic field which, as we shall see, accompanies a 
steady current. These are included in the classification magnetostatic 
field. It is true that energy is dissipated continuously in the latter 
case, but this is merely a consequence of the finite resistance of the 
conductor, and has nothing to do with the magnetic field, whose 
strength depends solely upon the current. If the resistance of a con- 
ductor could be made arbitrarily small— a condition which is almost 
realized at very low temperatures (cf. p. 449) — these magnetostatic 
fields could also be maintained without consuming energy. 

If, in analogy with the electrostatic field, we try to map the field 
by using a magnetic test body, we encounter the first difference be- 
tween electrostatic and magnetostatic fields: no test body exists 
having an independent magnetic charge. Even the smallest fragment 
of a steel magnet is a dipole which behaves in a magnetic field much 
as an electrical dipole does in an electrostatic field. Hence we can 
explore the field only by observing the torque exerted upon a small 
magnet of known magnetic moment m. From the calculation for the 
corresponding electrical case (p. 285), this torque is 

\M\*= — \m \ | H | sin 0, or in vector form, M = [mH], (1) 

where H is > the magnetic field strength. The position of equilibrium is 

SmaI1 coupler 

r = I w M H | 

If the moment of inertia of the small bar mamut i, „ . 

t « eXP ° rU f 6ntaUy by obaervi ng the period of vibration in 
the field [use equation (20) (p. 148)1. This is the ® 


897 
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field. In order to obtain the field strength H from the value of the torque 
Af , we must also know the moment m of the test magnet. Since the 
fields of magnetic systems follow the same laws as those of electrical 
charges, we can calculate the field of a magnetic doublet by the results 
of p. 272; then, by observing the forces experienced by other dipoles 
in this field, we can obtain a second equation connecting the two 
u nkn own quantities m and H. A detailed description of such an 
experiment may be found in any standard treatise on electricity and 
magnetism* The absence of isolated magnetic charges, or poles, implies 

div H = 0 (2) 


at all points in a vacuum. The magnetostatic field in a vacuum is every- 
where solenoidal (free of sources, or non-divergent). The electrostatic 
field, which may have a finite divergence even in a vacuum, is never- 
theless characterized by complete absence of curl. This is by no means 
always the case for a magnetostatic field, as may be seen from the 
simple example of the field about a straight wire carrying a current. 
As we know from elementary experiments, the lines of force in this 
case are circles whose centres are at the wire, their planes being normal 

to the wire. If we take the line integral £ Rds along one of these 

circles it cannot vanish, since we are continually adding quantities of 
the same algebraic sign. Hence the irrotational condition is no longer 
fulfilled by the magnetic field, for paths encircling lines of flow. Thus, 
while the electrostatic field in a vacuum must he everywhere irrotational 
hut may he divergent, ihe magnetostatic field in a vacuum is non-divergent 
at all points, hut may have a finite curl. 

In the same way as the electrostatic field between the plates of a 
condenser is reduced by a factor K when a medium of dielectric con- 
stant K replaces the vacuum between the plates, so the field between 
the poles of a magnet is changed by the introduction of a medium 
having different magnetic properties. The quantity corresponding to 
the dielectric constant K , which is analogously defined as the ratio 
Hvac : H, is called the magnetic permeability /x. The quantity /xH cor- 
responding to the dielectric displacement D is called the magnetic 
induction B. As in the electrical case, the effect of the material may 
be represented by a polarization of the medium. The magnetic quantity 
corresponding to the electrical magnitude P (the polarization) is called 
the intensity of magnetization 

M = kB. = O — 1)/4tt]H, (3) 

where k is the magnetic susceptibility. The intensity of magnetization 

* If the force is measured in C.G.S. units, m and H will thus be given in this system. 
The unit of magnetic field strength so defined was formerly called the gauss but is 
now designated the oersted. In the M.K.S. system the field is specified in terms of the 
current producing it (cf. § 2). 
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is also equal to the magnetic moment per unit volume. The mag- 
netic susceptibility differs in an important way from the electric 
susceptibility: there are substances for which k is negative as well 
as substances for which it is positive. In contrast with electric polari- 
zation, which is always in the same direction as the field, the mag- 
netization may be either in the direction of the field or in the opposite 
direction. But even in the electrical case, we found (p. 280) two dif- 
ferent ways in which the polarization may come about. The same 
two possibilities exist in the magnetic case also, except that the 
difference is far more apparent here, inasmuch as the one kind (align- 
ment of the already existing dipoles) yields positive values of k (para- 
magnetic substances), while the other kind (creation of dipoles by 
induction) gives negative values of k (diamagnetic substances). While 
the electrical susceptibility of liquids and solids lies between 0*1 and 
10, the magnetic susceptibility, with few exceptions, amounts only to 
about 10 -4 or even less. But for the so-called ferromagnetic metals, 
which are mostly metals of the iron group and certain alloys, k assumes 
enormously greater values (of the order of 10 3 ); however, #c is not 
constant, but depends upon the strength of the applied field. This 
fact may be understood from the way in which paramagnetic sus- 
ceptibility is believed to arise: if the resultant dipole moment may 
be ascribed to the alignment of existing dipoles, then this moment 
cannot remain proportional to H while H increases indefinitely, since 
an upper limit is attained when all the elementary dipoles finally have 
been brought into line with the field. 

After this digression on the subject of the atomistic interpretation 
of susceptibility we return to the description of the experimental pro- 
perties of the magnetostatic field, which is wholly analogous to that 

for the electrostatic field. On account of the absence of true magnetic 
charges, we have always 

div B = div (ftH) = 0 and Div B = faH n M — /a 2 h « (2) = 0. (4) 

On the other hand, H is derived from apparent (free) charges residing 
m non-homogeneous parts of the media: 6 


divH=4wp m , Div H = 4n<r m ( 5 ) 

If no finite (“ surface ”) current flows in the surface of separation of 
two media— a case which we exclude— then Curl H = 0 Hence the 

tienMaw ^ ““ ° f * the kterface “ governed by £ 

tanc^ fa 

tanaa fa ( 6 ) 


are no true single magnetic charges, so there exist 
magnetism. On account of the hwh vain a nf 
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ferromagnetic substances, the lines of force reaching these are very 
nearly normal to the surface, as may be seen from (6). This means 
that such substances behave very much like conductors, with regard 
to the static arrangement of the lines of force. 

The expression for the magnetic energy density also corresponds 
to that for the electrostatic case: 


u 


mag 


= o H 2 = BH, or = ^ H - i 


877 


877 


m M.K.S. units;* (7) 


hence the calculations of § 3, Chap. XIII (p. 290), may be applied to 
magnetism, as already mentioned there. 

Ex. 84. As a first approximation, the earth may be regarded as a sphere 
magnetized with uniform intensity. Calculate the angle of dip as a function of 
the magnetic latitude. 

Ex. 85. In a certain terrestrial magnetic survey, an area of 13-1 x 10 s km. 2 
was enclosed by a path from New York to England and back. The path was 
described in the counter-clockwise direction, as seen from above. The result was 


that 



H ds = —64 X 10 4 oersted cm. What does this mean? 


2. Calculation of the Magnetostatic Field accompanying a Given Dis- 
tribution of Electric Currents in a Vacuum. 


In elementary physics the magnetic field due to steady currents 
is calculated by means of a formula from the theory of action at a 
distance. This law, which gives the magnetic field due to an element 
of a conductor, is the familiar rule of Biot and Savart (p. 303). How- 
ever, this law is not a suitable basis for obtaining a more general re- 
lationship corresponding to the direct-action point of view ; moreover, 
the Biot-Savart rule cannot be verified experimentally, since there 
are no isolated elements of conductors — at least for steady fields — 
but only closed circuits. Hence we start from a different empirical 
fact. If we map the magnetic field of a conducting wire, as outlined 


on p. 297, and calculate the line integral j) Eds along any arbitrary 


curve which encircles the conductor, we obtain the same value, pro- 
portional to the current in the circuit, no matter how we choose the 
path of integration, as long as the circuit threads it: 


£ Eds = yl 


( 8 ) 


If we measure H in Gaussian magnetic units [the connexion is given 
directly by the Gauss method of determining the field (p. 298)], and 
if 7 is measured in the electrostatic system, then, in order to make the 
equation numerically correct, we must take the value 

y = 4t7/c, 


• The quantity /z 0 will bo defined on p. 312. 
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where c is numerically equal to the velocity of light in a vacuum, 
expressed in cm. /sec. If, on the other hand, we take y = 47 r, we 
thereby determine a new unit for 1 — the electromagnetic C.G.S. unit , 
which, however, we wish in principle to avoid using in spite of the fact 
that the practical units such as the ampere and the volt difier from the 
electromagnetic C.G.S. units only by powers often. In the M.K.S. 
system there is no specific unit of magnetic field strength, but the field 

setting 

by the current producing it, and has the dimension amp. /metre. 

We wish to bring equation (8) into a form consonant with the theory 
of direct action. Imagine that we have given an extended conductor 
in which the current density vector i is known as a function of posi- 
tion. If we draw a small closed curve G inside this conductor, the 
current passing through it is 

1 — i dS, • 


y = lm equation (8). Thus the field js specified 


is defined by 


where dS denotes the directed element of area bounded by G . From 
the definition of curl, 


<f) H ds = (curl H) dS 


for this element of area. According to equation (8), this must be 
equal to 

4tt 7 4772 dS 


Since this relationship must hold for any element of area whatsoever 
for slowly varying fields 9 


, _ 4772 477 _ 

curl H = — = — aE. 

c c 


. . (9) 


or in M.K.S. units, 


curl H = aE. 


(9') 


This is the differential form of the law corresponding to the theory 
of direct action. The vector equation states that the field lines of 
the current density vector are the same as those of the vector curl H. 
bince the divergence of a curl is identically zero (p. 28), the lines of 
flow can neither originate nor terminate in the finite region. They 
must either form closed curves or go to infinity. J 

The problem for the electrostatic field was to deduce an inolativml 

■ t glV L en dmtabutlon of sources, or charges. This task was 

sunphfied by the introduction of the electrostatic potential, whioh 
could be computed readily by integration over all the charges In 
the complementary problem now before us— the 
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solenoidal or non-divergerU field from the distribution of the vortices 
(currents) we likewise simplify our task by introducing an auxiliary 
quantity. This quantity is no longer a scalar, but a vector, which we 
call the vector potential. We take 

H = curlV (10) 

The non-divergence of H is thus provided for. Then by equation (98) 
(p. 40), 

curl H = curl curl V = grad div V — AV = — . . (11) 


Without loss of generality, we can assume that divV vanishes; for if 
it did not do so, we might add to V an irrotational vector having the 
same sources as V, but with opposite signs, without changing H in 
any way. Thus we obtain the same type of equation for the vector 
potential as for the scalar potential in electrostatics: 



In component form (cf. p. 40): 




( 12 ) 

(13) 


The solution of this set of equations is known from electrostatics: 
if we are dealing only with currents distributed in space (surface cur- 
rents, corresponding to surface charges in electrostatics, are tem- 
porarily excluded), the solution is 



The component forms may be combined again into a vector equation: 

W = (15) 

J CT 

By means of this equation the magnetic field of a linear conductor 
(wire) may be determined at once. Here the vector i is in the direc- 
tion of the axis. Let the element of length ds be in this direction 
also, and let the cross-sectional area of the wire be S. Then 

tdr=^Sds (16) 

and thus, since I has the same value at all points of the conductor, 

V = - f d — (IT) 

oJ r 
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The integration is to be extended over all the elements of the con- 
ductor and has nothing to do with taking the curl of V, in which 
operation the co-ordinates of the field point are differentiated. 
Hence 

B - 1 - It -/; "4 -/ ' 0* d ]■ *]-fl ^ <■»> 

The factor 4 rr/c on the right side of (12) is absent in the M.K.S. system, 
so that in these units T _ 7 

pr _ f 1 [ds, r 0 ] 

* K a - ~ J tor r 2 


(18') 


Here r 0 is a unit vector drawn from the segment ds toward the field 
point. We can interpret this equation to mean that each element of 
length of the wire contributes an amount 

l [ds, r 0 ] 
c r 2 

to the field. But this is precisely the Law of Biot and Savart , according 
to which the field at P due to the current in the element ds is normal 
to the plane determined by ds and the radius vector from ds to P, 
and is of amount Ids sin 6 /cr 2 . 

. it may be shown without much difficulty that a closed 

linear current (circuit) gives rise to the same magnetic field as an 
arbitrary surface covered by magnetic dipoles which have the direc- 
tions of the normals to this surface, and which have a magnetic moment 
of I/c per unit area, the surface being bounded by the circuit. A sur- 
face thus uniformly covered by normally set magnetic doublets is 
called a uniform, normally magnetized shell. It is, of course, further 
understood that the poles of one kind are all on one side of the shell 
This equivalence of a magnetic shell and a current may easily be 
proved with the help of certain transformations of vector analysis 
We must be careful to note whether the field point or the element of 
the conductor is the element which varies. All differentiation referring 
to a displacement of the field point will be characterized by the sub- 
script/; den vala ves relating to variation of the element of the con- 
ductor will be denoted by the subscript «. We again note, as on p. 281, 

,/(,) = —grad 

^e transform the expression for the vector potential at the 

* b {- a S^^ation of Stokes’s theorem. Let a be a Liar 
point function, and let it be required to transform the line integral 

f uds “to a surface integral. We multiply scalarly by an arbitrary 

constant vector « and obtain the following result by using Stokes’s 
theorem and equation (89) (p. 38): * 8 8 
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a £ uds = j) uads = J [curl(ua)]dS = 

f [gradu, a]dS = aj [dS, gradti]. 

Since a is perfectly arbitrary, we must have in general 

j)uds= J[dS, gradu]. 

If we apply this equation to the vector potential as given by equation 
(17) (p. 302), we must remember that the field point is fixed, and 
hence all derivatives relate to change of the elements of the conductor. 
Hence 

= 1 1 [ dSl grad 4] = \ I [ n ' g rad .;]<^- 09) 

If now we compute the field strength H by taking the curl, the field 
point is to vary, so that 

H = ^ curly J jVz, grad, ? J dS = curlew, grad, ? J dS 
= — ^ fcml, j n, grad, J J dS. 

The integrand may be transformed by introducing the V-operator: 


v=4^ 

cJ r 


curl, [m, grad,^] = [V/[«V,|]] 


= w.V/V,^ - . . (20) 


Since, however, Vy Vy - = Ay - = 0, we obtain 

r r 


H = +f grad/ gr ad / ^ 


dS 


= — grady 



nr 


dS. 


( 21 ) 


Hence the magnetic field is derivable from the scalar potential 

V m = f I --JdS = I (™^dS, . . . (22) 

m J c r 3 c J r 2 

where r is the radius vector drawn to the field point from the element 
dS of any surface bounded by the conductor. Comparison with the 
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expression for the potential of a double layer [(29'), p. 273] shows 
that the dipole moment per unit area is Z/c and that the direction of 
the dipoles is that of the normal n to the surface, the sense being 
such that the current flows in the clockwise direction when sighting 
along n . In M.K.S. units the dipole moment per cm. 2 is /x 0 Z. 

But the potential has different values on the two sides of the sur- 
face, since the integral j) Eds must amount to 477 Z/c for a complete 

circuit, and the contribution of the arbitrarily small space occupied 
by the shell must vanish. This confirms the result found on p. 274, 
that the potential has a discontinuity amounting to 4tt | m | at a 
double layer. The result of (21) may be stated: At points outside any 
surface bounded by a closed circuity ike magnetic field is that which would be 
produced by a double layer covering this surface uniformly * The surface 
may have any form. It represents a surface of discontinuity such that 
on passing through it the potential must be increased by 47r | m |. 


Ex. 86. Calculate the magnetic field at interior and exterior points of an 
infinitely long solid cylindrical conductor of radius B. 


3. Calculation of the Magnetic Field accompanying Electric Currents 
when Ferromagnetic Materials are present. 

If we fill the region about a current-carrying conductor with bodies 
of various permeabilities, experiment shows that the line integral 

<j) Eds taken over any closed curve continues to have the value 47 tI/c, 

tfhere I is the total current enclosed by the path of integration. It 
follows that equation (9) (p. 301) is independent of the permeability of 
the medium. Hence, in contrast with electrostatics, where the field due 
to charged bodies is reduced by a factor K by filling the space with 
a medium of dielectric constant K, the magnetic field of currents re- 
mains unaltered if the entire space be filled with a medium of per- 
meability it. This is true, however, only provided the entire region is 
thus filled (see below). 

Since, on the other hand, the formal application of the theory 
developed for electrostatics requires that the magnetic field of ficti- 
tious single charges or dipoles be reduced by a factor p in a medium 
of permeability (i, we must increase the strength of the equivalent 
magnetic shell of the circuit by this factor if H is to remain the same 
when the permeability is changed to /x. Thus we must give the shell 
a moment /xZ/e per unit area, or in M.K.S. units, /x/x 0 Z. 

If, however, we place ferromagnetio materials in certain portions 

ILT!?? *° a J am P le ma P> 0tio "tell, the bounding edge ofwhich^ino?de 3 ^ith 

h proposal £ o"^sr that “• 4116 strength 0f th6 ahoU - 
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of the region only, a change in the field occurs in the magnetic case 
also, and this change is considerable on account of the high value of 
the permeability. Since equation (9) (p. 301) is independent of the 
medium, the field H and the field Hq in a vacuum must have the same 
vortex lines, viz. the lines of flow of the current, which are the field 
lines of the vector i. Hence the field to be added to Hq, viz. H — Hq, 
must be irrotational and hence derivable from a scalar potential. It 
naturally suggests itself that this scalar potential may be simply that 
of the dipoles of the magnetized bodies. This is really the case, and 
may be demonstrated as follows: Let us assume the usual case where 
we deal with individual ferromagnetic bodies, each of uniform per- 
meability. Then by § 1 (p. 299), there exists no volume divergence of 
H, but there is a surface divergence at the bounding surfaces. This 
divergence is connected with the apparent surface charges whose 
surface density is cr m ' by the relation 

Div H = Div (H - Ho) = 47Ta m ’. 

On the other hand, on account of the absence of spatial source, 
(charges), 

div H = div (H — Hq) = —div grad V *= —AV = 0. 

The solution of this equation is known from electrostatics. If the 
only sources are those distributed on surfaces, this solution is 


-n 


dS. 


Now since Div B is always zero (absence of true magnetic charges), 
we have, from (3), p. 298, 


DivM = 


— Div H = —o m 


Hence the potential, from which the supplementary field is derived 
becomes 

V = -j 5^? dS (23) 


The integration is to be extended over the surfaces of all the ferro- 
magnetic bodies. Since these must be separate closed surfaces, we 
have the added potential in the form given for the electrostatic case 
at the end of § 2, Chap. XII (p. 281). Using Gauss’s theorem, and 
reversing the argument used there, we can represent the potential as 
an integral over the volumes occupied by ferromagnetic substances: 

DivM ^ rMdS — rULr 


F= -?.<$> 




dr. (24) 
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But the last expression is exactly the potential of the dipoles of the 
magnetized bodies; this is the fact we proposed to demonstrate. 

If the field which produces the magnetization M were not itself 
altered by the presence of the ferromagnetic substances, calculation 
of the field would be a simple matter. In that case, one could first 
compute the vacuum field strength Hq from the distribution of the 
currents, then determine from this the magnetization of the bodies 
introduced, and thus calculate the supplementary field of the dipoles 
from this result. How far from the truth this result would be is shown 
by the example of the sphere in an originally homogeneous field — 
a problem which we already treated for the electrostatic case. Accord- 
ing to equation (23) on p. 284, the interior field which determines 
the magnetization is no longer Hq, but 

*”-th** < 25 > 


so that for /x = 2000, H (/) is only about 0T5 per cent of Hq. 

As we saw above in the electrostatic 
case, the exact calculation of the field is 
possible only for bodies of simple form. 
A case of practical importance, viz. the 
field between the poles of a ring-shaped 
electromagnet, will be calculated approxi- 
mately in another way. We choose a special 
kind of ring-magnet consisting of a ring or 
tore of ferromagnetic material spirally wound 
uniformly over its whole length with a wire 
carrying a current (fig. 1). From the sym- 
metry of the arrangement we see that 
the magnetic lines of force must be circles 
whose centres are on the axis of symmetry of the ring, and which lie 

in planes perpendicular to that axis. If we take the integral (f Bds 

along a circle K x or K 2 lying entirely outside the ring, we see that this 

mtegral is zero, since no lines of flow are threaded by the path of 

integration. On the other hand, for such a circle within the ring, each 

of * he . N t 18 dreaded once, so that the integral over the entire 

path is irrNI/c. Thus the field is confined to the space within the 
windings and is of amount 



Fig. i 


a = 4wtf//(2woc) = 2 N l /(ao), or H = NIftZm), 


(26) 


where a is an average radaus. The M.K.S. measure of the field is 
especially vivid, for 2/(2™) is merely the number of ampere-turns per 
metre. If, as we shall assume, the cross-section of the ring is small 
compared with its diameter, the field within the ring will be approid- 
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mately uniform. We now cut a section of thickness l a out of the ring. 
How large is the field H a in the gap thus made? If we assume that the 
lines of force retain their circular form, we can compute H a very easily. 
Actually, a considerable dispersion of lines of force takes place; the 
field in the neighbourhood of the gap is no longer confined to the ring. 
If we neglect this scattering, then, H { being the field within the iron, 
we have 

nH t = H a (27) 

on account of the continuity of the normal component of B at the 
iron-air surface. Again, if the portion of the circle within the iron is 
of length l it the line integral of the magnetic force is 

l,u, + UK = ^ + UK = — , 

/x c 


i.e. 


jj 4:7tNI/C 


(28) 


The actually measured field H a is smaller than this, on account of the 

spreading of the lines of force. The ratio H a : U a is called the coeffi- 
cient of scattering. 


For a cross-section S, the flux in the air gap is 


<D a = H a S = 


4tt Nile 
IJS + IJtiS' 


Ln electrical engineering, this formula is brought into formal correspondeuce 
with Ohm’s law by calling 4nNI/c the “ magnetomotive force”* and calling the 
denominator the “ magnetic reluctance ”. Although this analogy is purely 
formal, it furnishes a practical rule for computing “ magnetic circuits ”. If, for 
example, we have several different cross-sections, we obtain the flux roughly by 
using the expression for the magnetic reluctance. 


4. Ponderomotive Forces on Conductors in a Magnetic Field. 

The calculation of the forces acting upon current-bearing con- 
ductors in a magnetic field is considerably 
simplified by making use of Ampere’s 
theorem concerning the equivalence of a 
magnetic shell and a current (p. 305). 
Using the formulae derived in our treat- 
ment of electrostatics, we then can give 
immediately the potential energy of the 
shell. The work done by the force is ob- 
tained by calculating the energy change 
accompanying a slight displacement, and from this we arrive at the 
force itself. We apply this method to two important cases : 

* In M.K.S units this is merely NI, i.e. the number of ampere-turns. 
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1. The moving-coil galvanometer. A cylindrical coil of N turns 
is suspended in a uniform magnetic field by a torsion fibre which is a 
continuation of a diameter of the coil (fig. 2). If a current I flows in 
the coil, the magnetic moment of the coil in air is NIS/c , where S is 
the area enclosed by each turn. The magnetic moment vector is normal 
to the plane of the coil. If this moment makes an angle 6 with the 
field H, then by equation (27) (p. 285) the torque acting on the sus- 
pended system is 


,, NIS „ . Q 

M H sm/9 


(29) 


This moment tends to turn the coil into the stable position where 
M is parallel to H. The torsion of the suspension opposes this turning. 
If 0 = 7t /2 is the rest position when no current flows, this torque is 
a maximum. If the displacements are small, we can take sin0 = I. 
Since the restoring moment of the fibre is proportional to the dis- 
placement (f> — 77-/2 — 6 , the equilibrium posi- 
tion is determined by 



, NSH j 

T <}> = — — ly 


(29') 


F'g- 3 


where r is the moment of torsion of the fibre. 

2. Force on a segment of a current-carry- 
ing conductor in a magnetic field. Imagine 
a segment ds of a circuit which may be slid 
back and forth (fig. 3). We seek to deter- 
mine the force which the uniform field H exerts on this segment. 

By Ampere’s theorem and (26), p. 285, the energy of the entire circuit 
is, in air (/x ^ 1): 


77 = — - f HdS — — - H f ds = - 1 - 

c J C J c 


HS. 


(30) 


If the area is slightly changed by giving ds a displacement 8a, the 
energy change is 

877= — - HSS. 
c 

The vector SS may be represented by the vector product [8a ds ]. If 

the factors are taken in this order, the normal is obtained in the right 

direction * This displacement is accompanied by an amount of work 

F8a done by the force. This work must be equal to the decrease in 
energy of the field: 

FSa = -8U = + ^ H [8ads] = -f l [dsB] ha t 

f th ! field, and the ponderomotive foroe moat 

form a right-handed orthogonal system, m this order (Fleming’s left-hand rule). 
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so that the force itself is 

F = ^[ds H ]= i [z H] <Zr, .... (31) 

where dr is the volume of the part of the conductor of length ds. 
1 nis force, tending to enlarge or decrease the area of the circuit (de- 
pending upon the direction of the current), is always present, whether 
it causes the conductor to move or not. If the current in a coil is very 
large, the magnetic field of the coil itself gives rise to large mechanical 
forces actmg on the various portions of the winding. 

By p. 305, the magnetic moment per unit area is increased by a 
factor p if a medium having a permeability of this amount replaces 
the air. In this case formulae (30) and (31) must be multiplied by a 
and we have for the force J 

F = ^[ z ' H ]^. or F M . KJ . =pp 0 [fH]dr. . . (31') 

Although in computing the force we may replace a circuit by a mag- 
netic shell, we must avoid setting the potential energy used here equal 
to the field energy. The reason is that something new appears when 
the conductor is moved through the field, viz. an induced electro- 
motive force (see next chapter) whose effect is to weaken the current. 
In order to maintain the former current, energy must be supplied. 
The net effect is that for a fixed current the total field energy is found 
to increase by exactly the amount by which the magnetostatic poten- 
tial energy would— according to the above calculation — decrease. 

Ex. 87. The Ballistic Galvanometer . — A current of very short duration is 
sent through the coil of a suspended-coil galvanometer at the time t = 0, the 
total quantity of electricity which flows being e. What is the maximum displace- 
ment? (Take the time integral of the force, and compute the amplitude of the 
vibration from the initial velocity thus found.) 
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Slowly Varying (Quasi-stationary) Fields 
Definition of quasi-stationary fields 

* * 

All electric and magnetic fields so far considered have been inde- 
pendent of the time. We now go a step further and consider fields which 
vary with the time, but restrict ourselves for the present to fields 
varying relatively slowly. These are of great importance. We shall 
find later that electrical fields are propagated with the velocity of 
light. If, then, we restrict ourselves to not too rapid changes, and to 
systems whose dimensions are not too great, we may assume that a 
state corresponding to instantaneous propagation holds at every point 
of the field. The relationships derived on this basis hold under all 
conditions for the range of alternating current frequencies in technical 
use (10 to 1000 cycles per second). 

1. The Law of Induction. Maxwell’s Equations. 

Imagine a closed linear conductor in a magnetic field. Let the 
surrounding medium have a permeability /x. We know from elementary 
experimental physics that if the field H changes, a current is “ induced ” 
in the conductor. This means that there must be an electric field E 
within the conductor. Since we assume the conductor to be linear, 
E has the direction of the linear elements ds. If we compute the 

integral £ Eds, say in the direction of E, this integral can no longer 

vanish, since all the contributions to the sum are of one sign. We 
thus encounter a rotational electric field for the first time. The value 

of £ Eds is called the induced electromotive force, for the following 

reason. If we imagine the conductor severed between two neighbour- 
ing points P 1 and P 2 , the work which must be done to move unit 
charge from P 2 to P x against the field is 

—W = (j) Eds, 

since the contribution of the open portion P X P 2 is vanishingly small. 
Hence there is a difference of electrical potential * of this amount be- 
tween P x and P 2 . The Law of Induction states that the induced elec- 
ts for exam P le » the form of a spark passing between 


811 
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tromotive force is equal to 1/c times the decrease per second of the 
number of lines of magnetic induction enclosed by the circuit.* The 
usual orientation of surface normal and direction of describing the 
circuit is assumed. The mathematical formulation of the Law of 
Induction is then 

$ Eds = - l citf^ ds (!) 

In the M.K.S. form there is a quantity on the left having the dimen- 
sion volts; on the right, amp. /sec. Hence there appears a new dimen- 
sional constant, called the permeability of free space, whose numerical 
value is 

Po = 1-256 X 10~ 6 volt sec./amp. 

The M.K.S. form of the law is thus 


0 

j) Eds = — g- J p 0 pRdS. 


. (la) 


The line integral J Eds may be changed to a surface integral by using 
Stokes’s theorem: 

1 

j) Eds = J cut\ ErfS = — - g- J pRdS. . . (2) 

For stationary objects, integration with respect to position and dif- 
ferentiation with respect to time are independent, so that we may write 


J curl EdS = - l c f^ (/lxH) dS (3) 


Since this equation must be satisfied for any surface, the integrands 
of the surface integrals must be equal, and we obtain the first triple 
set of Maxwell's Equations of the electromagnetic field: 


, udR 

curl E = — ■ - -a— 

c ot 


1 dB 
c dt ’ 


. . (3a) 


In the M.K.S. system, 


curl E = — pp 0 


dR 

sr 


(3b) 


The Maxwellian theory assumes that this induced electric field is 
present even if not detectable in the form of a potential difference 
between the ends of a material conductor; in other words, this set 

* This statement, sometimes referred to as Neumann’s Law, is a fact of experience. 
The part giving the direction of the induced electromotive force is an example of what 
[s known as Lonz’s Law. 
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of equations is generally valid for expressing the magnitude of the 
electric field induced by a changing magnetic field. 

Ex. 88. Compute the current induced in the coil of a moving-coil galvano- 
meter as it swings, and find the damping of the motion of the coil produced by 
this current. 


2. Self and Mutual Induction. 

(a) Calculation of the coefficient of mutual induction 

Let there be given two circuits 1 and 2. If a current flows in circuit 
1, it generates a magnetic field H x whose lines of force pass through 
circuit 2. If the current I x in circuit 1 changes, an electromotive force 
is induced in circuit 2. Its magnitude is given by 

F 2 = E 2 ds 2 = - ? J dS 2 = - £ ^ / Hj dS 2 . (i) 


If the whole of space is filled with a medium of uniform permeability 
/i, then according to Chap. XV, § 2 (p. 302), H x is derivable from a 
vector potential : 

H, = curl V„ V, = 1 [* = h 

CJ r u c j 

Hence, by Stokes’s Theorem, 

/ H l( 2S 2 = f (curlVj) <ZS 2 = V,fc 2 = I 1 (f <fi (5) 

* J 2 C J 2 r \% 

i.e. the electromotive force induced in circuit 2 becomes 




ds-^ ds 2 


12 



The quantity j) — * ^ Sz t 


or 


d *2 

±ttJJ Tl2 


M.K.S. units (henry) 


Which depends only upon the geometric relationships of the two cir- 

cuits and upon the permeability of the intervening medium is caUed 

the coefficient of mutual induction, or the mutual inductance L„. With 
this notation (6) may be written 


F 2 = /E 2 ^ = _i 12 9 A (6') 

Since both circuits enter into the expression for L n symmetrically, 
we obtain the same expression if we calculate the electromotive force 
induced in circuit 1 by a change in the current flowing in circuit 2, i.e 

X ia=Au- (7) 
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(6) Self-induction 

. Even if there is but one circuit, a change in current causes a change 

m the magnetic flux through the circuit which, by the Law of Induction, 

results m an induced electromotive force. In the same way as we 

defined the coefficient of mutual induction above, we define the 

coefficient of self-induction , or the self -inductance to be a certain positive 

magnitude L n depending only upon the geometric form of the circuit 

and the permeability of the surrounding medium. The defining 
equation is 6 

$Vd s = -L n d f. ( 8 ) 

Here, too, we must use the negative sign, since the direction of the 
induced electrical field is such as to oppose the original current (Lenz’s 
Law). Were it the opposite way, the original current would be in- 
creased by the self-induction, hence a still greater electromotive force 
would be induced, and so on without limit. 

If we attempt the calculation of the self-inductance of a circuit 
in a way similar to the above computation of mutual inductance — i.e. 
by calculating the field at ds caused by all the other current elements, 
then computing the total induced electromotive force by integration 
over all conducting elements — the resulting formulae are found to have 
only limited application. In calculating mutual inductance we are 
permitted to assume that the conductors are linear, since the separation 
of the two circuits is taken to be large compared with their cross- 
sections. For the case of self-inductance, however, this is no longer 
the case, and we must divide the current into individual filaments 
and take into account their mutual influence. Thus the calcula- 
tion of self-inductance is generally more troublesome than that 
of mutual inductance, so that we shall omit the detailed treatment 
here. 

In the case of coils, we are mainly concerned with the magnetic 
field at the position of a selected turn caused 
remaining turns; hence a division of the current into filaments is not 
required. In this instance the coefficient of self-induction may be 
calculated simply and to a good approximation by another method. 
We start again with a ring-magnet of N turns, the cross-section S 
being small compared with the radius a of the ring. The field inside 
the windings is [equation (26) (p. 307)] 

H ±7tNI 4:77 N I 

2Trac Ic 

If the strength of the current changes, the total magnetic flux 


by the current m the 
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through the N turns will also change, and this change, per second, is * 

dr d<& , TO dH ^ 10 ^TrN dl 

S J^ dS = -<U= llNS lU= * NS ~h~ A* 

bo that the induced electromotive force is 

v = f Eds = _^q±M 

The self-inductance of the ring-shaped coil is therefore 

L u = or in M.K.S. units. . . (9) 

This result holds approximately for a very long unclosed coil also. 
The approximation consists in neglecting the disturbance of the uni- 
form field caused by the ends of the coil. 


(c) Transformation of the expression for the magnetic field energy 

The energy of a system of current-bearing conductors is mainly 
that of the magnetic fields caused by the several currents. Since the 
electric fields present in such systems are small, the electrostatic energy 
is relatively negligible. With the help of the concepts of self and 
mutual induction, the expression for the magnetic energy may be 
brought into a form which is very useful for many purposes. Accord- 
ing to equation (7) (p. 300), 

Um= Lf iiSldT < iq ) 


We assume that space is filled with a uniform medium of permeability 
although the final expression obtained will be valid even if the 
permeability varies from place to place. /xH is non-divergent every- 
where, but H is non-divergent only in a homogeneous medium. We 
may thus write 

H = curl V. 

Now by equation (92) (p. 38), H(curl V) = V(curl H) + div [VH]. 
Substituting this expression in equation (10), we can transform the 
second integral into a surface integral over an infinit e sphere. How- 
ever, by p. 302 we see that V decreases as 1/r, so that ourl V decreases 
as 1/r 2 , and hence this integral vanishes. As a result, we have 

v ” = L / v < cui 1 nj* = £ • • (ID 


• Here, and in the formulae following, integration 
turns. 


implies summation over all N 
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If we have only linear conductors, which, of course, must be closed 
if currents are to flow at all, (11) yields 

Um== Y(I s, (IT) 

where /* is the current in the kth conductor. If we now decompose 

the vector potential at the position of circuit k into the parts V l5 V 2 , . . . 

contributed by the various conductors (including circuit k), then we 

have by equation (15) (p. 300) and the definitions of the coefficients 
of mduction, 


= f L ik l t and f V k ds k = - L kk I k , . (12) 


Hence we have 

V m , t = \(L u I* + ‘iL u l 1 l 2 

2A ln / 1 / n -f- L 2 2^2 -f- 2X33/2/3 ~f" • • 


L nn I n 2 )- (13) 


Ex. 89. A charge — e revolves about a fixed charge -j-e in a circle of radius 
a. How much energy is absorbed by this system when a magnetic field H is 
applied, the angle between the field and the normal to the orbit being 0 ? 


(To simplify the calculation, assume that 


SH 

dt 


= const.) 


Ex. 90. (a) Calculate the mutual inductance of two coaxial circles of wire, 
radii a, and a 2 , whose centres are at a distance z apart. 

( b ) Show that for two conductors L n L 22 > L n 2 always. 


3. Stationary Alternating Current Circuits. 

With the help of complex numbers we can derive relationships for 
the stationary state of uniform periodicity attained by circuits to 
which periodically alternating potentials are applied. The steady 
state referred to is that arrived at some time after the initial applica- 
tion of the electromotive force. These relationships permit us to 
extend formally to alternating current circuits the laws of Ohm and 
of Kirchhofi derived above for direct currents. 

Let us consider a circuit consisting of a source of periodically 
alternating electromotive force V — say a dynamo — connected across 
a conductor of ohmic (steady current) resistance R and self-inductance 
L. In addition to the applied E.M.F., there is also that due to the self- 
induction of the circuit, so that 


• 

• 

Ns 

II 

1 

, . . . (14) 

If R may be neglected, we have 

V-L dI 

V - L dt 

. . • (H') 
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Assuming the phenomenon to be strictly sinusoidal, 

V = V 0 e , ' ut , 1 = Z 0 eH . 


Hence 


^ = iwl 0 ^‘ = iwl. 



and so V — IiwL (16) 

Comparing this expression with Ohm’s law, we see that a formal 
correspondence is obtained by setting the alternating-current resis- 
tance caused by self-induction — the so-called reactance of the circuit — 
equal to iwL. What is the physical meaning of this imaginary resis- 
tance of self-induction? It has been found of great advantage to re- 
present alternating current phenomena by means of the vector 
diagram (cf. p. 52). If we place the “ vector ” of the current along 
the positive real axis of such a diagram, (16) shows that the vector 
for the electromotive force lies in the positive imaginary axis. That 
is, in a circuit containing only an alternator and a pure inductance, 
the electromotive force leads the current by a phase angle 7t/2. For 
a given applied potential, the strength (amplitude) of the current 
is given by (16) as 



Suppose, now, that the inductive portion of the circuit is replaced 
by a capacity (condenser) C. If the applied E.M.F. were constant, 
then after the initial charging current no current would flow, but on 
account of the periodic charging and discharging of the condenser, 
an alternating current results. Here again we obtain an imaginary 
resistance due to the presence of the condenser. Since the current I 
is the quantity of electricity flowing through a cross-section of the 
conductor in unit time, the charge on the condenser is given by the 

time integral^*/ dt, if we start to count time at the instant when the 

condenser has zero charge. Hence the potential difference of the two 
sides of the condenser which, neglecting the resistance of the con- 
nexions, is equal to the electromotive force of the generator, has the 
value 


- ei> 


so that ~ = 4 


dV 

dt 


I 

c 


. (18) 


For a purely periodic applied potential we have, since dV/dt=ia)V, 

1 (19) 


V = , 


iwC 


Hence the alternating current resistance offered by a capacity C is 
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given by 1/iatC. In this case, since i in the denominator is equivalent 
to i in the numerator, the applied electromotive force lags behind 
the current by a phase angle tt/2. 

By using these virtual resistances of self- 
induction and capacity we can treat any 
alternating current circuit like a direct cur- 
rent circuit. After completing the numerical 
computations, it is only necessary to draw 
the index diagram for current and electro- 
motive force; this gives the amplitude and 
phase relationships between these quantities. 

As an example, let us take a circuit containing ohmic resistance, 
self-induction and capacity connected to an alternator. By Kirch- 
hofi’s first law (p. 294), 

V = V B + V L -\- V 0 = I ^ R + ia>L + ^ . . (20) 

If we take 1 along the positive real axis in the index diagram, V L is 
along the positive imaginary axis and V 0 along the negative imaginary 
axis. According to fig. 1, the resultant electromotive force is of mag- 
nitude __ 

V=,J«- + („L - ±y m) 

This electromotive force leads the current by an angle </>, so that we 
have V = F 0 < g*" 1 , and I = I 0 e , ' (w, ”* > , where 




For an applied electromotive force of given amplitude F 0 , the ampli 
tude of the current is 



Taking only the amplitude into consideration, the system has an 
apparent resistance of magnitude 



This quantity is called the impedance of the circuit. It has a minimum 


value when 



or co 2 = 


1 

LG' 
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In this case there is no difference of phase between V and I. This 
condition corresponds to resonance , and we shall find, in the next 
section, that the frequency of the applied potential in this case is the 
same as the natural frequency of the circuit. 

In order to calculate the power, i.e. the amount of electrical work 
done per second, it is convenient to turn to the real part of the equations 
and write 

V = V 0 coscot, 1 = / 0 cos(oj£ — (j>). . . . (25) 

The power W t at a given instant t is 

W t = VI = V 0 I 0 cos cot cos {cut — <f>). . . . (26) 


It is more important to know the time average of the power, W. We 
need only calculate the average over one period. Thus 


— (jy /*2tt/w 

cos cos (cot — = (cosajf coscoZ cos^-f- cosa>£ sin cut sin (f>)dt 


= \ COS <j>. 



Hence the 


average power is 


W = £F 0 / 0 cos(j> 



The expression cos <j> is known as the power factor of the circuit. If the 
impedance is purely inductive and capacitive (R= 0), </> = ± tt/2, and 
no power is consumed in the circuit. In this case the current is said 
to be wattless. This shows why it is better to control alternating cur- 
rents by choke coils (inductances) rather than by ohmic resistances. 

The analogy between equation (28) and the expression for mechanical 
work (force into distance into cosine of the included angle), which 
becomes especially apparent with the use of the vector diagram has 
no deeper significance and is of mnemonic value only. 


in parallel.' Cal ° uJate the ^P 6 ^ 1106 of ** inductance and a capacity connected 

a i t Ex ' A Sh °^ that 03omatin g circuit containing an iron- cored cod aa 
app*M. UCtanC6 “ a non * harmomc Vrtem to which the results of p. 101 may be 


4 * NOn CkcSts ary Stat6S (TfanSient Phenomen *) “ Alternating Current 

The relationships between current and electromotive force in 
altematmg current circuits which we have derived by means of the 

The ? • he 8teady state wMch "** in after some time. 

SieSw 6 ll° f • 3 ll n0t ap , plioable t0 the transient conditions 
immediately following the completion of the circuit, for these phe- 
nomena depend upon the initial conditions. Since the results obtained 
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above place no constants of integration at our disposal, we have no 
way of satisfying the initial conditions. Mathematically, the solutions 
corresponding to the steady state are particular integrals of the dif- 
ferential equation, and we have now to find the general solution. 
We shall treat four different systems, beginning with the simplest and 
proceeding to the more complicated. 


(a) Free oscillations of a circuit containing only inductance 

and capacity 

Assume that the two plates of a charged condenser are joined by a 
conductor of negligible resistance. We must insert the expressions 
for the potentials due to self-induction and capacity (§ 3), in equation 
(20), p. 318, in the form of derivatives and integrals: 

T dl 1 r 

L d t + cJ Idi = ° < 29 > 


Differentiation yields 
vibrations 

whose integral is 
where 


the familiar equation of simple harmonic 



. . . . (30) 

1 = ae?*' 1 -f- be~ iu>tt . . 

. . . . (31) 

• 

• 

• 

jo 

> 

II 

O 

3 

. . . . (32) 


Using real functions, (31) may be written 

I = A cose o Q t -f B sinoy or / = 7 0 cos(oj 0 £ — <j>). (3T) 


(6) Free oscillations of a circuit containing inductance , capacity 

and resistance. Damping 

Here we have a term Rl to be added to the left member of (29). 
Differentiating as above, we find 

L S + R § + 1 c I = ° (33) 

This equation, too, is known from mechanics [Chap. V, § 8 (p. 95)]. 
The real form of the general solution obtained there is, for 

LC ^ iL * ' 

I = Ae~ Rtl2L cos {a) ft — <t>) (34) 
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The amplitude of the vibration decreases exponentially as the time 
increases, and the vibration is therefore said to be damped. The fre- 
quency of the characteristic oscillation of the damped system, co 0 , 
is smaller than that of the undamped circuit: 


, / 1 B* ^ 1 

"° V LC 4 L i<w ° VLC 



The ratio of two successive amplitudes, corresponding to one period, 
is constant, and is called the decrement. One usually deals with the 
natural logarithm of this quantity — the logarithmic decrement 8 
(cf. p. 96): 


8= log 


- R t 

e 2 L 


e~l (l+T) 


R T _ R 277 

2 L 2 L a V’ 



where T is the period. If the damping is not too great we may here 
use the frequency of the undamped vibration in place of the actual 
frequency of the damped system, in which case the logarithmic decre- 
ment becomes 


On the other hand, if 


_1_ 2P 

LC < 4 L v 



there is no imaginary part, and the solution is a purely exponential 
decreasing function 



Ae lc) 1 -J- Be (S , 



an aperiodic system. The limiting case l/LC = R 2 /4X 2 is also aperiodic; 
it corresponds to the mechanical situation treated on p. 95. 


(c) Damped circuit with periodic applied electromotive force 
The differential equation is 

+ RI + I Idt = V o e '"'‘ • • • ( 39 ) 

Differentiating with respect to the time, 

L S + R d i + V 1 = . . . (40) 

According to the theorem on p. 92, the general solution of this non- 
homogeneous equation is given by the sum of the general solution of 
the corresponding homogeneous equation, i.e. equation (33) above. 
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and a particular integral of the non-homogeneous equation. But we 
already know a particular integral, viz. that corresponding to the 
steady state, which was derived in § 3 (p. 318). For 


_1_ IF 

LC > iL 2 ’ 


the complete general solution is 


I = 


e £' (A cosu , 0 't + B sin.*,,'/) + — C03faif ~ 


vM'— i)’ 


Loj — 


where 


tan cf) = 


c uC 


R 


( 41 ) 


The part of this equation containing the exponential factor represents 
an oscillation having the angular frequency cof of the characteristic 
oscillation of the free damped circuit; the other term is an undamped 
periodic function having the angular frequency co of the applied 
electromotive force. At the beginning, the resultant vibration is a com- 
plicated oscillation which, however, soon attains the steady state 
given by the last term, the transient part being negligible after a 
short time. 

(d) Inductively coupled free circuits 

Of the many possible arrangements of coupled circuits which are 
of technical importance we select the simplest case for treatment here. 
Let us consider two circuits which are inductively coupled. Let the 
self-inductances be and L ^ respectively, and let the mutual induc- 
tance be Z 12 . Here, again, we shall carry the calculations only far enough 
to reveal the new principles involved in this type of coupled system. 

The differential equation for the first circuit becomes after dif- 
ferentiating once with respect to the time: 

(42) 

That of the second circuit becomes 

T d-I, r cPI 2 I 2 / i o \ 

Lu w + L * a? + c t ~ 0 (43) 

In order to eliminate I 2 , we differentiate equation (43) twice with 
respect to t, obta inin g 

d*I x r W, 1W ! = 
dt* 22 dt* ^ C t dt 2 


. . ( 44 ) 
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But from (42), 

d 2 I 2 _ 1 

dX 2 ^12 

(l dH '+ 1 A 
\ u dt 2 + c x 7’ 

... (45) 

and after differentiating twice, 




d'h _ 1 / 

dt* \ 

r « i . 1 d 2 Ii\ 

k 11 dt* 1 G 1 dt 2 ) 

. . . (46) 


Substituting these values in equation (44), and using the abbreviations 


n — l r — A 2 h — L 11 4- ^ 22 r - 
a — L, n L>22 ^12 » 0 — T" C ~ 

II 

to 

*6 

- («) 

we obtain the fourth-order differential equation 




* • « 

• (48) 

We assume a solution of the form 



/ x = Aet 1 . . . 

• • • 

. (49) 

and obtain the characteristic equation 



aA 4 -f 6 A 2 + c = 0. 

• • • 

(50) 

Of the four roots, two are essentially different: 



Xl = i ylL + 'l^- C a = iw *> 

• • 

• (51) 

X2=i 'l-T a -'l^- C a =iw * 

• • 

• (52) 


The other two roots differ from these only in sign. According to 
£x. 90(p. 316), a is always positive, and 6 and c are essentially positive 
while b 2 — 4ac is a sum of squares. Hence, since L u =(= 0, we always 


0 < 


6 * 

4a 2 


c 

a 


4a 2 ; 



f;™ ,72 r ? W ? eqU , e ,o? ieS and " 2 are alwa ys real. By elimina- 

ting d IJdt* between (42) and (43) we see that there exist two simul- 

taneons frequencies <o, and o> a , for 7, as weU as for I v Neither of these 

coincides with the natural frequencies of the separate 
circuits. The solution for I 1 thus has the form F 

I i = Be-^K . . (54) 

The co-existence of two vibrations of different frequency implies the 
occurrence of beats, i.e. the appearance of slow variations of amplitude 
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as may be seen at once from the following trigonometric identity: 

o>i — oj 2 


COSc o Y t -J- COS C(J 


= 2 cos 


t cos 


H 3 ) -• 


The right member represents a vibration of angular frequency 
+ ^ 2 )/2 whose amplitude is “ modulated ” (cf. p. 57) with an 
angular frequency (coj — o> 2 )/2- Since the circuits are assumed to be 
undamped, the energy of circuit 1 cannot be destroyed when its 
amplitude is reduced to zero, but must be transmitted to the other 
circuit. The beats occurring in the two circuits must therefore have 
a phase difference of n/2, so that minimum amplitude in the first 
circuit corresponds to maximum amplitude in the second, and the 
sum of the energies (squares of the amplitudes) remains constant. 
This is actually confirmed by the complete calculation, which will 
not be gone into here. 


5. Resistance and Inductance of Wires for Alternating Currents. Skin 
Effect. 

All low-frequency electromagnetic phenomena in homogeneous 
media are determined by the Law of Induction (p. 312), together 
with equation (9) (p. 301 ), which was derived for steady currents, and 
expresses the connexion between current and magnetic field. If several 
different media are present, the boundary conditions of the continuity 
of the tangential components of E and H must also be taken into 
account. In view of the meaning of the curl, the above two equations 
really represent a system of six first-order partial differential equa- 
tions for the six components of E and H. Of the enormous number 
of solutions of such a system, we must select that which corresponds 
to the problem under consideration. This selection, and not the cal- 
culation of the integrals, is the difficult matter. We consider the 
following very simple case. Let there be given a very long, straight 
wire of finite cross-section to whose ends an alternating difference of 
potential is applied. For direct current, the current density would 
be the same at all points within the wire, provided the conductor is 
made of homogeneous material. For alternating current, the current 
density is greater near the surface of the wire, on account of inductive 
effects. When the frequency is very great, the current is almost entirely 
confined to the outer layers. This phenomenon is known as the skin 

effect. . 

The dependence of the current density upon the distance p from 

the axis may be calculated with little difficulty from the two differen- 
tial equations: 

curl H = - 77 ^'; curlE = - curl i — — - ~-r. • • (55) 

c a c ot 


SLOWLY VARYING FIELDS 


325 


XVI.] 

H may be eliminated as follows. Take the curl of the second equation 
and differentiate the first with respect to t. Since we are dealing with 
bodies at rest, the derivatives with respect to the co-ordinates (in 
taking the curl) and those with respect to the time are entirely inde- 
pendent. Hence 

d ITT i 9H 

a - curl H = curl a,. 

If we eliminate this term from the two equations obtained, we find 

, , . , ,. . . . \ttgix di 

curl curl l — grad div i — lai — — - . 

& c 2 at 

Since div i = 0 , this equation reduces to 

*-? w « 

On account of symmetry, the vector i in the infinitely long wire has 
the direction of the axis of the wire. We take this to be the 2 -axis 
Then we may write 

i = u{p,t)k (57) 

We thus obtain a scalar equation which has the following form in 
cylindrical co-ordinates: 

d 2 w 1 du __ inafi du 

a ? 2 p Ft (58) 

We are looking for a solution u which is a periodic function of the 
time, and so we put 

«=/(/>)«*"• (59) 

and obtain the following ordinary differential equation for/: 

d 2 f i I dj . 47 TGUOJ . _ 

dp 2 + } dp ' — T~ f~ 0 ( 6 °) 

In this equation we set 

iTTGlKDl 

^ = a ( 61 ) 

and introduce a new variable x = ap. Then 

df df # 

dp = S a ’ &c ( 62 ) 

and we obtain -f I % 4 . /*== n 

d^^xdx^ J u (66) 

But this is the differential equation for the Bessel functions of otder 
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zero.* The only one of these functions of use here— the one continuous 
at the origin — is denoted by J 0 (x). Hence 


m AJ^ap)#"* — y/ — i \/ 4:7rapxv ^je i,ot t . 


(64) 


. . . . (65) 


where A is the amplitude factor. 

The real part and the imaginary part of the function J Q {xy/~i) 
occur in various problems and are tabulated as new functions of x. 
They are known as “ ber (x) ” and “ bei (x) ” respectively: 

(i?e)[«7 0 x\/— i] = berx 
(Im)\J 0 x — i\ = beix 

Using these functions, our solution may be written 

u = A j^ber^V 47 i bei 47rcr/x4o^^ (66) 

ber 2 ^iTopxo ^ + bei 2 ^y/ irrcrpxx) 


u= A 



(66') 


where 


bei ( \/ 47 Tcpw 
tan <f> = ' 



ber^\/ ±TT(jpx» 


That is, not only the amplitude, but also the phase depends upon the 
radius. For large values of /x (iron 
wire) both effects can be shown 
even for currents of very low fre- 
quency (1 cycle/sec.). One can 
obtain a picture of the process 
by drawing the vector for the 
current density corresponding to 
various values of p (fig. 2). In 

order to determine the amplitude factor A we have to integrate 
i over the croBS-section, which gives the total current as & function 
of A , so that A in turn may be evaluated in terms of I . 



Fig. a 


• See p. 810 seq. 


CHAPTER XVII 


Rapidly Alternating Electromagnetic Fields: 
I. Propagation in Homogeneous Isotropic Media 


1. The Electrical Analogue of the Law of Induction. 

The Law of Induction states that, in a vacuum, any change of the 
magnetic flux through a surface bounded by a curve results in the 
formation of an electrical “ vortex ”, i.e. a finite value of the line 

integral (j) Eds. The reciprocal relationships between electrical and 

magnetic phenomena already encountered many times would lead 
us to expect that a change in electrical flux similarly would cause a 
magnetic “ vortex ”. Thus far we have encountered magnetic curl 
only when conduction currents were present. We write as a tentative 
form of relationship, for a vacuum, 

CUrlH = Jl7' OI ( M - KS -) £ o^. ... (1) 


This equation was derived by Maxwell from the notion that the volume 
elements of the “ ether ” are polarized when a field is applied, just as 
are those of a material dielectric (p. 279). It may be seen readily that 
electric charges are displaced when a volume element is polarized 
hence that a current flows while the polarization is being brought 


Imagine a small cylinder of unit cross-section and altitude dh in 
which a polarization P is generated parallel to the axis. The apparent 
charges on the transverse surfaces of the cylinder amount to o+ = + I p I 

T f n p h ®, Upper “ de an< ^, <7 - = T I P I on the lowe ri according to p. 282. 
If P changes by dP, the density of charge is altered by + I dp I and 

| dP | respectively If this charge is to reach the bounding surface 

it must pass through the cross-section immediately below or above 

Hence a positive current dP/di flows in the upper half and a negative 

i fl ° WB m the Wer haU > but “ opposite direction 
so that a displacement current ” of ’ 
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electromagnetic units flows during the time the polarization is chang- 
ing. But Maxwell s concept of a polarizable ether built up of electric 
charges is no longer tenable, and we are justified in assuming the 
existence of a true displacement current only in the case of material 
dielectrics. Hence we have thus far no rigorous basis for the term 


1 d_E 
c dt 9 


especially with regard to the sign. We shall therefore justify it for the 
present only by noting that all its consequences are confirmed by ex- 
perience. Later on we shall see how the field equations may be de- 
duced by a method which is based on very few hypotheses. Hence in 
a conducting dielectric medium we have three contributions to the 
current producing a magnetic field, so that we may write in general 


curlH = 


— 4- l ? E i *ZT ?? __ , 1 dE 4tt 0E 

c c dt c dt c c dt ' c K dt' 



The last two terms are usually combined into - so that we write 

C ut 

CUi\K = -jt +— (2a) 


0E 


or, in M.K.S. units, curlH = KK 0 ^ -f- ctE. 


(26) 


Following Maxwell’s ideas, we refer to the term 

KdE 
c dt 


as the displacement current , to distinguish it from the actual conduc- 
tion current. The above equation (2a), together with equation (3a) 
(p. 312) [or (26) and (36)], constitute Maxwell’s equations for the electro- 
magnetic field. 


2. The Wave Equation for the Propagation of Fields in Dielectrics. 

• # 

For isotropic dielectrics at rest the following equations hold: 


lTT KdE dE 

curl H = — or KK 0 - . 
c dt at 


P TTTf 

dt’ 


curl E = — - or —KK 0 

c ot 


(I) 


(II) 


If, as we shall assume, no volume charges are present, we have in 
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addition the equations of continuity of the electrical and magnetic 
fields, viz. 

divE=0 and divH=0 (Ill) 


Either E or H may be eliminated from the Maxwell equations by the 
method of p. 325 . If we choose to eliminate H, we take the curl of 
the second equation, while we differentiate the first with respect to t. 
Using (III) we then obtain 


A „ Ka 9 2 E Aw „ d 2 E 
AE = w , or AE = 



This is the differential equation of a wave-like propagation of the 
components of E. By eliminating E, we obtain an exactly similar 
equation for H. Of the tremendous manifold of phenomena described 
by this differential equation we now consider the simplest case — that 
of a plane wave. We understand this to mean a wave for which all 
quantities depend only upon one co-ordinate, say x. The components 
of E have constant values in planes normal to the rr-axis. Mathemati- 
cally, this means the vanishing of all derivatives 9/3y and dldz. Hence 
from (III), 


divE = 





it follows that 




Hence the electric field can have no component in the direction of the 
»-axis. Since we are dealing with a wave motion, a superimposed field 
which has the same value at all points does not interest us here; we 
may conclude that equation (4) implies that E x = 0. In the same 
way we find that E x = 0. Hence the vanishing of the divergence of E 
and H implies the transversality of the plane wave. 

We obtain the equations 


d 2 E v Kp. d 2 E v 
dx 2 c 2 dt 2 


and 


d 2 E t _Kp d 2 E, v '* 

dx 2 c 2 dt 2 ‘ * * 



for the components E v and E,. We assume a simple periodic depen 
aence on the time: r 


r&f and E t ~ g(x)^ (ut ~ 




where the phase constant 8 is included to ensure generalil 

d 2 / w*Kn <Pg ufiKu ' 

— /=° and A+ —^g= 0. . 


dz? o 2 



i 
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The integrals of these well-known equations of simple harmonic vibra- 
tions are f = A e ±,u% ' Ic f ix:e and g = Be ±lUi , # (8) 

Using the negative sign, we then have 


E, = and E t = Be * 1 - # ^ 

This is the equation of a plane wave propagated in the direction of 
the positive x-axis (the negative sign would signify the opposite direc- 
tion) with a wave velocity 


v = c/VKfi, or v = llVKK of ifi 0 . . . . (10) 


In the C.G.S. system v = c for a vacuum, so that the factor c introduced 
on p. 300 has a highly significant physical meaning. It represents the 
speed of electromagnetic waves in a vacuum, as anticipated there. In the 

M.K.S. system c = l/V K 0 fi 0 , as is evident from the dimensions of K 0 g 0 . 

E v and E. are independent of each other, so that the phase difference 
is arbitrary. The most general form is thus the elliptically polarized 
wave in which the end pomt of the vector E describes an ellipse at any 
given place; if this motion is confined to a straight line we have a 
plane polarized wave. 

In order to calculate the components of H we are not permitted to 
start with the wave equation for this quantity; this would furnish us 
with an excess of constants. We must observe that E and H are con- 
nected by the field equations. For E, = 0, the second one gives 


dH y _ g. cH t 
dt ’ c dt 


c_E, = \_hjj. - vjfc „c K (11) 

dx c 


Integrating, 


E,= 0. H,= «-*■'> = 

\ fl 




Since we are no more interested in a temporally constant field than 
in a spatially constant one, we may set the integration constants of 
(12), which are independent of time (functions of x) equal to zero. 

Thus we have £„// v + E Z II X = 0, 

so that E is perpendicular to H in a plane polarized wave. Let 
us designate the direction of propagation by the unit vector ft. 
Then E, H and ft form a right-handed orthogonal triad, in this order. 

Electromagnetic waves embrace an enormous range of physical 
phenomena, the essential unity of which remained unrecognized for 
many years, chiefly because the methods of production and the effects 
of the waves differ widely according to the value of the frequency v 
(or the vacuum wave-length A *= c/v), as the accompanying short 
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Electric 

Waves 


-to' 


-I0 f Wireless Waves 
- 10 * 

-103 
- 10 * , 


-10 


-t 


-to 1 


Infra-red 


Visible Region 
Ultra-violet 


■ 10 * 


table (fig. 1) shows. The entire range, with certain overlappings of the 
methods of generation and of investigation, is now known without 

. a break. 

cm> In optics, the ratio 

of the phase velocity c 
in vacuum to that in 
any other medium is 
known as the index of 
refraction n of the 
medium. Thus, by 
equation (10), 

v Short Electric Wares Kp. = n 2 . (13) 

■Infra-red Hg -vapour Emission This equation is known 

as Maxwell’s relation. 
But, as even the familiar 
example of water ( K ■■= 
80, p & 1, n = 1-33) 
shows, this relation is 
not always satisfied if 
we take n to be the 
optical index of refrac- 
tion. If (13) were satis- 
fied down to arbitrarily 
short waves there could 
be no dispersion at all, 
i.e. no variation of n 
with wave-length, if 
K and p are not depen- 


-10* 


dr 



x- 


rays 


Near Infra-red 

-10+ ‘ ^ 

s Ultra ViolA Region. 

fp Lyrruui-MtlUkan RegLn 
Soft X-rays 


'-10+ 


-to - 7 
- 10 + 
HOT* l— 


y-rays 


to 


-4(T 

vr " 

1 


X-r 


ays 


y-rays 


Cosmic Rays f 

Fig. i 


. dent upon frequency. 

The discrepancy is explained by the fact that the polarization of a 
medium (which determines the value oi K) becomes dependent upon 
the frequency for rapidly alternating fields. Down to wave-lengths of 
about 1 cm. the static value of the dielectric constant may be used 
for all substances. For shorter waves (higher frequencies) dependence 

on frequency comes in. This will be explained in the second part of 
the Theory of Electricity (p. 452). ^ 




3. The Poynting Vector of Energy Plow. 

The energy density in the electromagnetic field has the value 

“ = Sr {KE * + MlP) ’ or “ = P«oK« + w qH 3 ). . (14) 


Since by (12), I H 



El or 



KK C 

W o 


El 
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for a plane wave, ATE 2 = /xH 2 , or M 0 E 2 = /x/x 0 H 2 . . . . (15) 

Thus the energy is half electrical and half magnetic in this case. Now 
we know that energy is transported by every electromagnetic wave. 
In order to verify this experimentally we need only convert the energy 
into heat at a given point and determine calorimetrically the rate at 
which energy is being delivered. If the wave advances with a velocity 
v, the amount of energy streaming through unit area of a surface 
normal to the direction of propagation in the time dt is that residing 
in the space within a rectangular parallelepiped of 1 cm. 2 base and 
altitude vdt, i.e. 



(KE 2 + ^H 2 ) 


cdt 

Va> 


~ A E 2 = c — 

477 477 


E|.|H|. (16) 


We may therefore represent the energy flow by a vector in the direc- 
tion of propagation n whose magnitude is equal to the energy flow 
per unit time. This is the Poynting Vector 

s = ~ [EH], or S MK . 8 . = [EH], . . . (17) 


so named after its discoverer. Since E and H are periodic functions 
of the time, the vector S at a given place will also experience 
periodic fluctuations between zero and a maximum value. A 
change of sign never occurs, however, since E and H are always 
in phase [cf. (12)]. The Poynting vector is always in the direction 
of propagation. 

The Poynting vector S has a much more general significance than might be 
expected from the way it was introduced. We never come into conflict with 
experience by assuming that this vector may be applied in all cases. It holds 
for all simultaneous electric and magnetic fields — even for steady fields. For 
example, consider a-cylindrical wire bent into the form of a circle, the ends being 
joined to a source of direct electromotive force. Within the wire, the electric 
field is along the axis, while the magnetic lines of force are circles whose planes 
are normal to the wire. Hence the energy flux vector at the surface of the con- 
ductor is normal to the surface and, moreover, is directed inwards no matter 
which way the current flows. Thus there is a continuous flow of energy from the 
surrounding dielectric into the wire, and it is readily shown that the amount of 
energy entering unit length of wire each second is exactly equal to the Joule 
heat developed in that portion of the wire per unit time. The ultimate source 
of energy is the battery. From the atomic standpoint, however, this interpretation 
sounds artificial. 

Ex. 93. The amount of solar energy received by the earth is about 2 cal./ 
min. /cm. 2 . What are the amplitudes 'of the electric and magnetic fields of the 

radiation ! 
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A Propagation of Electromagnetic Waves in Conducting Media. 

For media in which there exists , a non- vanishing conductivity the 
field equations are 

1TT K 3E . 

curlH= t a7 + — E W 

cot c 

curl E = — - (II) 

C u t 

divE=0, djvH=0 (Ill) 

If H is again eliminated by the method of § 2 (p. 329), we have the 
following equation for E: 

. Kfid z E 4t raft 3E 

AE --^ + ^ 37' 

02 E 

or AE = 3J2 4- Wo - dt <T (18) 

We again consider plane waves propagated in the direction of the 
positive jc-axis, and set 

k - »• k- »■ 

from which it follows that E x = H x == 0. Assume again that E v and 
E t are simple periodic, functions of the _time: 

E v =f(x)e*”* and E M = g{x)e**K . . . (19) 

For simplicity of notation, we restrict our discussion to plane polarized 
waves whose electric vector isjn the ay-plane; -hence we put E„= 0. 

By substitution in the wave equation we obtain the following dif- 
ferential equation for the point function/: 

d 1 / , / Kjiui 1 . Mrofiuj \ . 

d ? + \~^~ • • • (20) 

the solution of which is 

f(x) = Ae± tV K!ZF^i74wZxiC' t '(21) 

If a = 0, we again have the case of an undamped wave. In this in- 
stance we must choose the negative sign for a wave propagated in the 
positive ^direction ; hence the negative sign is to be used here also. 
We now introduce the following notation for convenience: 

VKiuo* — i . i-rrafuu - om(l — t«). ... (22) 
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Squaring and equating reals and imaginaries separately, we obtain 
equations for n and k : 


n 2 - n 2 /c 2 = K/jl 




• • • 



where T is the period of vibration. Solving simultaneously we obtain 


» = J 2 (V* 2 + 4a 2 T 2 + ff) 




= /v /5(V^ 2 +4a 2 r 2 - Z) 


. 


• • (24) 


Using the notation of (22) in (21), equation (19) then gives 

= A.6 lUit g~i<»Tixlc g-ojnxlc __ j^g-wnx/c giw(t-nx/c) 



The imaginary term —iriK is thus seen to furnish a real negative 
exponential function, i.e. a damping factor, while the velocity of 
propagation of the phase is determined by n. Thus the use of this 
same n for the index of refraction in insulating media is justified. 
The meaning of k is as follows. Since 


7lw 2tTV 277 

c v X* 


the damping factor becomes e 2n * x/ \ 
For x = A, the factor becomes 

e~ 2n *. 


That is, in travelling a distance equal to the wave-length in a vacuum, 
the amplitude diminishes by a factor 


We now calculate the vector H, using equation (II) (p. 333). As 
in the case of insulating media, only a z-component of H remains. 
Using (25), this is given by 

t 1 ** = _ - d ^ v = -(«« + in) Ae~“* n * ,c (26) 

dt [i ox n 

i e. H z = - n( 1 — vc)Ae-' OKnxle e iuj{t ~ nx/c) = - n(l — m) E v . (27) 

/X P 

Now from the relationships in the complex plane, 


1 — U = Vl + k 2 e ~ iy > and tan y = k. 


. . (28) 
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Hence 

h, = - = - vt+v Ae-"*** (29) 

Thus we find that there is a difference in phase between the electric 
and magnetic vectors in conducting media: H lags behind E by the 
angle y. According to the discussion in this section, every conductor 
should absorb electromagnetic waves, while all insulators should 
transmit them. However, we can give at once examples which con- 
tradict this: A solution of NaCl, in spite of high electrical conductivity, 
is perfectly transparent, while vulcanite, which is a good insulator, 
is opaque. The explanation is again, as in the case of the index of 
refraction, variation with frequency — here, variation of the con- 
ductivity. In calculations we may use the static value of the conduc- 
tivity down to a wave-length of about 1 cm. for all substances. A 
possible solution of the difficulty will be discussed in the second part 
of the Theory of Electricity (§ 2, p. 451). 


6. Hertz’s Solution of the Field Equations. The Hertzian Oscillator. 

Up to now we have considered plane waves without inquiring into 
their origin. The waves actually come from spatially limited sources, 
whether they are wireless waves emitted from an antenna which is 
small compared with the distances spanned by the signals, or light 
waves emitted by the radiating atoms or molecules of a gaseous source 
of light. For this reason, the waves are approximately plane only at 
great distances from the source. H. Hertz found a solution of the 
field equations which provides a unifi ed representation of source and 
radiation field in the whole of space. If we confine our attention to 
a vacuum (free space), the appropriate field equations are 


curl H = 


IdE 
c dt 



curlE = — 


1 dn 

c dt * 



div E = 0, div H = 0 (HI) 

In determining the magnetic fields accompanying steady currents 
it proved expedient to express H as the curl of an auxiliary vector v’ 
the vector potential, which bore a simple relation to the distribution 
of currents. We proceed here in a similar manner. The formula be- 
come appreciably simpler if we choose as vector potential the time 

Wth t “ aUXiHary Veotor 2 311(1 “Hrfy also the factor l/o. 
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Since the order of the differentiations can be reversed, we may also 
write J 



(curl Z). 



Substituting in (I), and integrating with respect to t f 

E = curl curl Z = grad div Z — AZ. 

In order that (II) may be satisfied we must have 


curl E = — 4 curl 

c 2 


a 2 z 

dt 2 ' 


• (31) 

. (32) 


Integration yields 



1 d 2 Z 

c 2 dt 2 


+ grad </>. 



The arbitrary vector grad</> appears in place of a constant of integra- 
tion, since the equality of the curls of two vectors implies only that 
these vectors differ by an irrotational vector grad<£. The two ex- 
pressions (31) and (33) for E will be identical if we take 


and 


(j) = div Z 

AZ = I^ 
Az <? dfi 



Hence, as may easily be verified independently, the values of E and 
H defined in terms of Z by (30), (33) and (34) satisfy (I), (II) and (III), 
provided Z is a solution of the wave equation (35). 

We no longer seek the solution corresponding to a plane wave, for 
this could give no singularity at the origin; rather, we attempt to 
find the solution corresponding to a spherical wave, in which the sur- 
faces of equal phase are spheres. For this purpose we write AZ in 
spherical polar co-ordinates, and assume that Z is a function of r only: 


d 2 Z 2 dZ _ 1 d 2 Z 
dr 2 r dr c 2 dt 2 ' 



Assuming a simple periodic variation with time, we write 

Z = F (r)e^ 


and obtain the equation 

d 2 F 2 dF co 2 n 


(37) 

(38) 


for the point function F. 


The integral is 




e~ turle . 


. (39) 
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as may be seen by substitution, or by writing the equation in the form 

£ W + £ (rF) = 0. 

The negative sign corresponds to a wave spreading outward from the 
centre r = 0. We then have 

e iu(t—rlc) J)(t\ 

z = a, -r- = • • • («) 

% 

where ft{t) replaces This spherical wave differs from a plane 

wave in that the amplitude decreases as 1/r, corresponding to the 
spreading of the energy over a sphere of increasing size. The field 
vectors E and H are obtained from Z by simple differentiations. Start- 
ing with H, the simpler of the two, we have 

curl Z = curl {>/(r)} = [grad/(r), 


= f [r 0 & = f r [r 0 Al («) 


where r 0 is a unit vector along r. From (40), 


df 

dr 


laj 

or 


l) e -,w /e . 


(42) 


Hence 


H = l It CUrl Z = 7 (r^ + t (43) 

The vector H is thus seen to be composed of two parts, similarly directed 
but decreasing according to different powers of 1/r. We can locate 
the regions where one or the other term predominates by writing 


1 , i /I 

r 2 or r \r 




+ 


2-771 



(44) 


The scale of comparison is thus given by the wave-length. In the 
near zone (r < A), the first term predominates; in the distant zone 

aCtua radiation Sold— the second term controls the 
situation. Between the two is a region where both terms are of the 

same order of magnitude. We investigate the near zone first. Here 
we have 


H„ = — ^t-rlc) 


■? [A r d = £ \£i r o] (45) 


If wc compare this expression with the Biot-Savart formula for the 

66 d ^ 1)0 an element of a current-carrying conduotor 
(p. 303), we see that, apart from the factor * 5 
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the field in the local region is that due to a current element 



'dp • - , 

37 = ■ 


The factor 


g — iwrlc 
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corresponds to the finite velocity of propagation of the magnetic field, 
which causes a phase displacement, increasing with distance from the 
centre, with respect to the instantaneous value of the current element 
(“ retarded field ”). In order to gain a conception of such an alter- 
nating current element, we consider an electrical dipole whose length 
and direction are given by /, and whose ends carry periodically 
oscillating charges 

q = ±q<? wt (47) 


The moment of the dipole will also be a periodic function of the time: 


and we have 


p = ql = q^ 1 l, (4r) 


dj>_dq 

dt a t 



But dq/dt is the current flowing as the charge changes. Our solution 
thus gives the magnetic field of a dipole whose axis is fixed and whose 
moment <p is periodic. Such a dipole is called an electrical oscillator or 
Hertzian vibrator. Later we shall find that the electrical field in the 
near zone also corresponds with the electrostatic field of a dipole of 
periodically oscillating moment. 

A dipole of this kind may be represented by a short linear antenna 
in which high-frequency oscillatory currents are induced, say by 
inductively coupling the antenna to an oscillating circuit. In con- 
trast with steady currents, where the impossibility of isolating indi- 
vidual conductor elements made us reject the Biot-Savart law as a 
fundamental basis, current elements are experimentally attainable 
when we deal with oscillations. However, the magnetic field of such 
a segment is given by the Biot-Savart rule only in the immediate 
neighbourhood of the conductor, and then only to a first approxi- 
mation. According to the theory of action at a distance, we should 
use this rule at arbitrarily great distances from the conductor; this 
would yield results which are entirely contrary to experience, for at 
great distances (i.e. in the “ radiation field ”), the second term of (44) 
controls the situation. By (43) this gives 


= “ i ■ \ &o rj = + i r.] 


(49) 


This magnetio field whioh decreases as 1/r is no longer proportional to 
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the first derivative of -p — which by equation (48) is proportional to 
the antenna current — but to its second derivative, i.e. to the rale of 
change of the current. 

In order to calculate E it is best to start with equation (31) (p. 336): 

E=graddivZ — AZ (31) 


. . . . (31') 


We also use the fact that Z satisfies the wave equation, i.e 

„ , ,. „ 1 0 2 Z 

E = grad divZ ^ 

Now by equation (40) (p. 337), 

e 
'o 


divZ 


( p—Lirlc\ 

A — — ) = div {A/( r )} = A grad/(r) 




whence 




(60) 


9 


1 as 

r dr 


A}. (51) 


where p Q r in the first term has been replaced by p Q r 0 r in order to 
make clear which powers of r occur. We also have the expression for 
dfjdr given by equation (42) (p. 337): 


and, further. 


We also have 



. (52) 

. (53) 
. (54) 


The field in the near zone is governed by the highest power of 1/r 

only. Since r Q is a unit vector, this is the third. But this power does 
not occur in the term 

1 02Z 
c 2 0* 2 ’ 


so that the electric field in the neat region is derivable from a scalar 
potential 


^ 2 r= — divZ 

Neglecting the lower powers of 1/r, this quantity becomes 

Fy = AT sKl-r/c) _ e -,„,/ e _ 


( 66 ) 


( 66 ) 


But according to equation (28) (p. 273), this is preoisely the potential 
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of a dipole of moment p, taking into consideration the variation of 
phase with increasing distance caused by the finite velocity of pro- 
pagation. This is then the so-called “ retarded potential ”, 

To determine E in the radiation field we must collect the terms iD 
1/r. Since r 0 2 = 1 , this gives 



■»'oA) = ^""'" ,) [>'o[A»'o]]. (57) 


Taking the axis of the dipole to be the 
polar axis, H has the direction of the 
latitude circles [cf. equation (49)], and E 
is along the meridians [equation (57)] in 
the far region. E, H and the unit vector 
r 0 in the direction of propagation again 
form a right-handed system (fig. 2). In 
contrast with the phase, the amplitudes 
of E and H are not constant over the 
surface of the sphere, but decrease toward 
the pole {6 = 0) as sin 6 , on account of 
the factor [p 0 r 0 ]. This means that a 
linear oscillator emits no radiation in the 



3 


direction of its axis. This further implies that on account of the vanish- 
ing of the radiation term near 0=0, the contribution of the term for the 
near zone may not be neglected entirely, in spite of its decrease with 1/r 3 , 
since it does not vanish for 6=0. We are therefore not free to draw 
the meridional direction of E quite to the pole. The contribution of the 
first term causes the lines of force to become rounded at the poles, giving 
Hertz’s celebrated instantaneous picture of the lines of force (fig. d). 

* Consult H. Hertz, Ann. Phys. Chem., 30, p. 1 (1889), or hi a QesammelU Werke, 
II, p. 146. 
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If we now consider the energy flux vector S, which is in the direc- 
tion of r 0 , its instantaneous value is given by c/4v times the product 
of the magnitudes of the mutually perpendicular vectors E* and H*. 
Passing to the real functions, these are 



co 2 • a 

— p n sm o cos 
c 2 /-™ 



CO 2 f 

| H a | = — p o sin0cos ^co 


c 2 t 



(58) 


Hence we have 


I S I = Sr ' ? ° 2 8i “ 20 C ° S2<U 


{(t- 



J 


(59) 


Since cos 2 (j> = 1/2, the time average is, 


C a) 


|s| = ^ 


( 60 ) 


In wireless telegraphy the energy radiated in various directions is 
conveniently represented by a polar diagram in which the length 
of the radius vector p is taken equal to the radiated energy, on some 
suitable scale. The equation of such a curve for the linear oscillator 
is, from the above 

P—Po si n2 0 (61) 

The average energy which passes through a sphere of radius r each 
second is obtained by taking the surface integral 

$* dA = £ i p °*- 2mr *f 0 sin * 0sia BM=£ r - ( ^2iT.ipo*- (62) 

This gives the energy lost by the source of radiation each second: 

§=^Po a (63) 


This expression can be given another form by inserting the current 
amplitude I 0 in place of the amplitude of the dipole moment. Accord- 
ing to equations (46) and (48) (p. 338), the magnitude of 7 0 is c opjds. 
If we put the length ds of the element (the antenna) equal to l, 




This loss of energy by radiation is proportional to the square of the 
current, like the ordinary Joule loss. With an ohmic resistance R, 
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the same alternating current would have an average energy loss in 
the form of heat amounting to 6 n 




The ohmic resistance which would cause the same loss of energy as 
the radiation is called the “ radiation resistance ” R a of the system 
According to the above 


or 


2 

electrostatic units, 
80^) ohms. . . . 




It is seen that even for antenna lengths of but 1/10 the wave-length, 
the radiation resistance is of the order of 10 ohms, i.e. large compared 
with the ohmic resistance. 


Ex. 94. Show that the phase velocity of H is greater than c in the fcransi 
tional region between the near and distant zones. 


CHAPTER XVIII 


Electromagnetic Waves: II. Phenomena in Two 

Adjoining Media 


1. Unified Rigorous Derivation of the Field Equations and of the Boun- 
dary Conditions. 

In setting up the electromagnetic field equations we started with 
common facts of experience. In proceeding thus, we" were unable to 
give a rigorous proof of the electrical analogue of the law of magnetic 
induction, equation (1) (p. 327); moreover, the boundary conditions 
were derived only for static or stationary fields. Hence before pro- 
ceeding further, we shall explain a more satisfactory way of arriving 
at these results. The proposed method is based upon five hypotheses 
which, it must be admitted, are not as immediately obtainable from 
experiment as are those which formed the basis of our previous method. 
However, the following procedure has the advantage of generality and 
compactness. The five assumptions are the following: 


1. The expression for the density of electrical energy is 

w e i = jr E 2 

07 7 


a) 


2. The expression for the magnetic energy density is 




“ mag ~ 8 7T H2 


. ( 2 ) 


3. The energy flux is given by 

S - ± [EE] 



4. The expression for the electrical energy converted into Joule 
heat per second per cubic centimetre is 



5. The experimental fact of the linear superposition of electro- 
magnetic fields, which requires that the field equations be linear and 
homogeneous in E and H.* 

nnn ull *5°/? cons ido r ation here the speoial case of ferro-magnetio media, where 
valueo^the fieW h0 equatlons rosults from the faot that itself depends upon the 


848 
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These five postulates emerge in a plausible way from general con- 
siderations if we seek the simplest and most direct dimensionally cor- 
rect relations between the quantities involved. This is essentially what 
was done on p. 288 in obtaining an expression for the energy density. 

The field equations for a uniform medium follow at once from the 
following energy considerations : the decrease per second of the elec- 
tromagnetic energy within any volume is equal to the sum of the 
energy converted into heat and the energy which flows out through the 
bounding surface, i.e. 




Stt / J \ 4tt Gt 4l T 



=J aE 2 dr + ^ ~[ER]dS. . (5) 


If the surface integral is converted into a volume integral by means 
of Gauss’s theorem, we have, since div[EH] = H curlE— E curlH. 



„_0e . _aH 


+ 477-ctE 2 + cH curl E — cE curl dr = 0. 


( 6 ) 


Since the integral is zero for any arbitrary volume, the integrand must 
vanish. This is assured, and at the same time Assumption 5 is satis- 
fied, by setting equal to zero the factors by which E and H are scalarly 
multiplied. From this it follows that 


ITT K d E , 

CUI c 97 + 


— crE = - 
c c dt 




curl E = 


/x 3 H 
c dt 


1 9B 

c dt * 



The boundary conditions are obtained from the obvious condition 
that at the surface of separation of two media the energy leaving the 
first medium must enter the adjoining one, since for finite conduc- 
tivity no electrical energy can be dissipated as heat in the mathe- 
matical surface of separation. If we take the normal to the interface 
as z-axis, 

s w = — [E<'>H<»] ; = S t < 2) = ^ [E< 2 >H< 2 >]„ . (7) 

4:77 ^77 

or - H MEM = E™H™ - H™E™. . (8) 

This condition is secured by taking for the equations to be satisfied 
at the bounding surface * 

* Tho surface conditions (IV) are usually deduced directly from the fundamental 
equations (I) and (II) above. We take a small rectangular area having two 
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E™ = Ejn 


or 


E v ™ = 
E < J > = 


: E (2) 
E (2) 

^un 


H x a) = 
HJ»= HS Z) 1 


and 


77 ( 1 ) _ TJ ( 2 ). 

tan — ^ tan > 


■ . (9) 

. • (IV) 


or using the concept of surface curl, 

Curl E= [n (E 2 — EJ] = 0, CurlH= 0. 


. (IV') 


The relationships governing the normal components are already fixed 
by this condition of continuity of the tangential components, since 
E and H must, in addition, satisfy the field equations in each of the 
two media. The general validity of the continuity of the normal com- 
ponents of D = EE and of B = 71 H is readily demonstrated by a 
simple computation using (IV) and the field equations. This also 
implies, of course, that the normal components of E and H them- 
selves suffer discontinuities at the interface. 

2.* Waves in the Boundary Layer. The Ground Wave. 

The Hertzian solution, which includes the antenna as well as the 
radiation field, fails to represent actual conditions in this respect, that 
the transmitter is not suspended in space, but is located at the boun- 
dary of two media having quite different electrical properties. We 
must therefore find two solutions which satisfy the boundary con- 
ditions as well as the field equations — one for the upper medium (the 
air), the other for the earth. The problem, replete with difficulties, 
was solved in a series of papers by Sommerfeld. We restrict our atten- 
tion to the simplest case — the demonstration of the possibility of a 
kind of plane wave having its greatest amplitude at the bounding 
surface, and hence referred to as a surface wave or boundary layer 
wave. The consideration of this wave apart from the total phenomenon, 
it is true, is somewhat arbitrary. 

We take the surface of separation to be the xy-plane of a rect- 
angular system of co-ordinates, the direction of propagation being 
along the positive x-axis. For a plane wave in a uniform medium, E 
as well as H would be normal to the direction of propagation. But 
the boundary conditions cannot be fulfilled if we make suoh an assump- 
tion for two media. We therefore generalize the hypothesis m two 
ways: first we assume the waves to be non-uniform, i.e. the amplitude 
is to be a function of z, the distance from the interface. Secondly, we 

of ite ados normal to the surface, and the other two (which are made muoh longer) on 
opposite aides of the surface and parallel to it. On the assumption that there 'are no 
suriace currents, eleotrio or magnetic, it follows at once from I and II respectively 
that the tangential component of the magnetic vector and that of the eleotrio vector 
tmU ° U ! a0r08S the Surface - There 13 one important case in whioh surface 

T P ^ 68 ! nt ( *“* *S® 0480 of a conductor (that is, speakii 

? Perfect i refleotor); m this case the sole condition is that the tangen“ 
component of the eleotrio vector vanishes. 6 
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assume that H, which we take in the y-direction, is representable by 
a purely transverse wave. It will be seen later that E, on the contrary 
has an 2 -component in addition. Thus for H our hypothesis is 


H = H v j, H v — f{z)e i ^ t ~ x!v \ 


( 10 ) 


We leave the phase velocity of the wave still undetermined; it is to 
be expected that this will be determined by the electrical constants 
of the two media. With the above assumptions (10) concerning H, 
the field equations give for the components of E: 


gl'<u(j — x/c) __ 

dz 


0 = — 


IT ftfv(t-xlv) 

v J 


K 

dE x 

+ 

477(7 

E,. 

c 

dt 

c 

K 

dE v 

+ 

477(7 

E y . 

c 

dt 

c 

K 

dE z 

+ 

477(7 

E„ 

c 

dt 

c 


. . ( 11 ) 


. ( 12 ) 


. (13) 


We see from (12) that E v may be set equal to zero; on the other hand, 
the z-component of E does not vanish. 

The Maxwell equations (II, p. 344) also indicate that E is indepen- 
dent of the y-co-ordinate. The second of them yields the equation 

d_E x _d_E J 

dz dx 


_ — __ f^a-xiv) 


(14) 


We now assume non-uniform waves, propagated in the 2 -direction, 
for E x and E, also: 

E x = g (z)& u l*~ xlv \ (15) 

E z = h{zy^-*w (16) 

Substituting this value of E x in equation (11), we have 


9 = 


From (13) and (16), 


h = 


-4 

477(7 + Ka>i 

IWC M 

v J 

4 77 (7 -J- Kcoi 


■ (17) 


(18) 


And finally from (11), (13) and (14), 

dz 2 


-Ki-5) + 


^7T(JfJUl) . 


V 


f= Ae pz t 


. (19) 


( 20 ) 


The solution is 
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where 




47 TOflCO ^ 



This solution has the same form for both media. But since solutions 
increasing exponentially with z have no meaning, we must choose the 
sign before the radical in such way that for z > 0 (i.e. in air) the 
real part is negative, while for z < 0 (i.e. in the earth) the sign must 
be so chosen that the real part is positive. Denoting all quantities 
pertaining to the upper medium (air) by the subscript 0, the solution 
for this space becomes 

HM = A 0 e-^^- xl ^\ .... (21) 

and in the ground: H v = Ae +pz e i,,(t ~ xl l) (22) 


Also, by (20'), 




477-<7 0 /X 0 a )i 

c 2 

‘^7TO[JUx)i 

c 2 



The boundary conditions expressing continuity of the tangential 
components of E and of H, which must hold at every instant, can be 
fulfilled only if the phase velocity is the same on both sides of the 
interface, i.e. 

v = v 0 (24) 


Further, for z = 0, E v = H v yields the relationship 

4 = (25) 


The tangential component of E is obtained from equation (15), using 

(17) and (20) and attending to the remark made above as to the sign 
of p : 


i? (o) — ■ cp 0 Ae-**+«-*h>\ 

* 4 tt(j 0 -j~ K 0 a>i 



cp-4e +p *e*‘ o 0-*/t>) 

4770 -j- 



The boundary condition thus yields the equation 


Po __ _ p 

4 tto 0 -{- Kycui 4770 -f- Kcoi 



equation, taken along with (23) and (24), suffices to determine 
the remaining three unknowns p 0 , p and v. If we simplify the right side 
of equation (23) by introducing the notation 

ZoW 2 *7 TG Q fX 0 O)i 

C* ^2 — “0 + Po* = 0 0 , 


. . ( 28 ) 
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we have 


4ttct 0 -f K 0 coi 

CO 2 

-2 - Po 2 = b 0 

CO 2 

JS - PoY = b 


= 9 


• • ♦ i 
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(29) 


(30) 


From this, 


Po 


and then from equation (27) 


Finally, from (30) and (31), 


V 


. (31) 


P= —Po9 


. (32) 


1_ =+ ! b - fb 0 
v co v 1 — (j~ 


. (33) 


It is to be remembered that all three quantities Po , p and v are com- 
plex. By separating the real and imaginary parts of the exponential 
function it may be shown that the solution has the form 


H 1/ (°) = Ae~ r$z e~ tvx e tu, ^ t ~ xlu + t » s, °>) 


(34) 


From this result we see that 

1. The amplitude of the waves decreases with the distance z from 
the surface of separation. 

2. The waves suffer a spatial damping in the x-direction also. 

3. The planes of equal phase are not those for which x = const., 
but are the oblique planes x/u -f Sqz/oj = const. 

By substituting the solution in equations (15) and (16) we find 
further that the x- and 2 -components of E show a difference in phase, 
so that the end point of the vector E moves on an ellipse, not along 
a straight line. The plane of this ellipse, however, is not normal to the 
direction of propagation, as in the case of elliptically polarized free 
waves, but contains that direction. The above analysis does not 
correspond entirely to the general physical problem, inasmuch as we 
have dealt with plane waves only, without taking the source into 
consideration. The natural extension of the theory consists in seeking 
a solution corresponding to the Hertz solution for both media, taking 
the boundary conditions into account. Sommerfeld’s papers deal with 
this problem, but the details cannot be given here.* 

* An abridged treatment is given in Frank-Mises, Die Differential und Integral- 
gleichungen der Mechanik und Physik, II. Teil, p. 641, Braunschweig, Vieweg & Soho 
(1927). Consult also W. H. Ecclea, Wireless Telegraphy and Telephony, p. 169, Van 
Nostrand Co., New York (1916). 
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3. Consequences of the Boundary Conditions for Insulating Media. The 
Optical Laws of Reflection and Refraction. 

The considerations of § 2 with regard to boundary-layer waves 
are not applicable to the short electromagnetic waves of the optical 
range, for the reason that each radiating atom is so far from boundary 
surfaces (compared with the wave-length) that it may be considered a 
free radiator in a uniform medium. However, if a wave sent out from a 
radiating atom encounters the surface of separation of two different 
media, part will be reflected and part will be transmitted. We assume 
the source to be so distant that the wave may be considered plane. 
We also deal, at first, with wave surfaces of unlimited extent. 

It is assumed that the two media show no absorption, so that they 
are insulators, and the interface is taken to be the zy-plane of a rect- 
angular system of co-ordinates. Let n, the direction of the incident 
ray, be in the xz- plane. For the present we do not wish to make any 
special assumptions regarding the directions n' of the reflected ray 
or n" of the refracted ray. Assume that the planes determined by 
each of these directions and the 2 -axis make angles </>' and <j>" respec- 
tively with the <C2-plane. Denoting the angles of incidence, reflection 
and refraction by 0, O' and 0" respectively (see also fig. 1, p. 351), 
w^ have 


n — i sin 0 -f- k cos 0 

ft' = i sin0' cos^' -\-j sin O' sin<£' — k cos O' 
n" — i sin 0” cos <f> n -f -j sin 0" sin (j>" - \- k cos 0" 

The electric vectors of the three waves are then 



E = arn *l c ) = (*«»*+* co«0)iii/c} 

E' = A — ° ,rn i /«> + w 

— cos^'+y sinfi' sin$' — *cos5')n,/« J-f W 

E" = A n'mjc) + »5' 



while the magnetic vectors are given by 




, ^ interface we must have, for any values of x and y, at any 

instant / 


E x + E x ' = E x " i 
E v + EJ = E v " / 

B x + H x ' = H x " \ 

H, + h; = H," J 



and 


(39) 
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This is possible only if the exponents of e agree in all three terms. 
This implies 

a>' = oj" = co, (40) 


i.e. there is no change of frequency for reflection and refraction at a 
stationary surface of separation of two media. A second consequence is 


S' = 8" = 0 or 




which means that if any phase change occurs at the interface, it must 
be 7T. Further, comparison of the exponents yields 

f = r = o, 


i.e. both the reflected and the refracted rays lie in the plane of inci- 
dence — the plane determined by the incident ray and the normal to 
the surface; this is the xz-plane in our case. Finally, we obtain 

sin 6 = sin 6 / , (42) 


the law of reflection: The angle of incidence is equal to the angle of re- 
flection', also 

rq sin# = n 2 sind", (43) 


or 




the law of refraction (Snell’s Law): The sine of the angle of incidence 
is equal to the -product of the index of refraction and the sine of the angle 
of refraction. (See also p. 371.) 


4. Polarization and Intensity Relationships for Reflection and Refraction. 
Fresnel’s Formulae. 


The laws derived in the last section are also obtainable from almost 
any wave theory of light — in fact, from very elementary considerations. 
A more severe test of the electromagnetic theory is furnished by the 
question of the intensities of the various waves. While elementary 
wave considerations are of no help here, the electromagnetic theory 
gives the required answer directly from the boundary conditions. The 
equality of the exponents, which follows from the continuity of the 
tangential components and which is expressed by the laws of re- 
flection and refraction, yields the relationships (when we assume that 


= P-i) 


A t +A X ' = A X " (44) 

A V +A V ' = A V " (45) 

[nA] x + [»' A'], = n[n' f A"] x (46) 


[«A] V + [n r A'] v = n[n" A"] y ^7) 
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In component form, 

[nA] x —— A v cos9; [n , A'] x =A v ' cos0; [n"A"] x =—A v " cos0". (48) 
[nA] v = — A, sin 0-f- A x cos0; [ri A'] v = — A/ sin# — A x cos0; 


\fi"A ,r \ y = — A” sin 0" -f- A x " costf" 


(49) 


This resolution, however, is not the one appropriate to the physical 

problem. The natural resolution is 
that into one component in the 
plane of incidence and another nor- 
mal to this plane. In what follows 
we assume the light to be linearly 
polarized, i.e. the electric vector 
remains in a given plane, called 
the plane of vibration of the light. 
On the other hand, we shall avoid 
the use of the phrase “ plane of 
polarization ”, for this term is 
usually employed for the plane of 
the magnetic vector, and this is less 
important for the effects produced 
(e.g. the photo-electric effect) than 
is the plane of E. The component 
of the vibration normal to the 
plane of incidence is denoted by the 
subscript s and is given the direc- 
tion of the positive y-axis for all three rays: 

A B =A V> A; = A v \ A," = Af. . . . (50) 

The positive direction of the components in the plane of incidence, 
which are characterized by the subscript p, is so chosen that it is ob- 
tained by rotating the direction of the ray through +90° in the plane 
of incidence. We then have (cf. fig. 1) 



Fig. i 


A x = A v cos0 
A x ' = — Af cos0 
A X '' = A V " cos0” 

A z = — A p sin 6 
A, = — Af sin 0 
A/'=- A v " sin 0" 


(51) 


(52) 


Substitution in equations (44) to (49) gives 

(A p — A p ') cos 0 = A p " cos0". 

a. + a; =a;’. . . 


. . (53) 
. . (54) 
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A P +A P ' =nA " . . . 
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• (55) 

• (56) 

These are four equations for the four unknowns A p ', A/, A P " and A/\ 
Their solution is very simple. We have at once from (53) and (56) 

. , . ncosO — cos 0" . sin 20 — sin 20" 

A„ = A„ , (57) 


n cos0 + cos0" 


sin 20 sin 20"’ 


or 


A , 1 tan(0 — 0") 

v ~ Ap tan (0 -J- 0") 


. . . . (57') 


Likewise, from equations (54) and (55), 

sin(0— 0") 
3 sin(0+0") 


. . . . (58) 


Further, by substituting these values in (53) and (55) we have 

2 sin 0" cos0 


and 


A “ A ”am(0+ 0")cos(0- 0") 


„ . 2 sin 0" cos 0 

= rr 


(59) 


sin(0+ 0") 


(60) 


The equations (57) to (60) giving the relationships between the ampli- 
tudes of the rays are known as Fresnel’s formulae, after their discoverer. 

The negative sign in equation (58) has the following meaning: 
If 0 > 0", i.e. if the ray encounters an “ optically denser ” medium 
(n > 1), A, changes its direction upon being reflected. On the other 
hand, for (0 + 0") < rr/ 2, A p has the same sign as A P , but the positive 
direction is now reversed (p. 351 ), so that both components change sign on 
reflection from a denser medium, which means that the electric vector has 
a phase change of tt. This jump in phase is of importance in several in- 
terference effects that depend on difference of optical path, and must be 
taken into account by the addition (or subtraction) of half a wave-length. 

The ratio of the amplitude of the normal component to that of the 
component parallel to the incident plane gives the tangent of the 
azimuth u of the plane of vibration. Division of equation (58) by 

equation (57') gives 

, . , , , cos(0 — d 

tanw = tanu — .... Iv*; 

1 1 cos(0 -+- 0 ) 


and similarly, from (60) and (59), 

| tan u " | = tanu cos(0 — 0"). 

From these results certain others follow. 


. . ( 62 ) 
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In the first place, (61) shows that for 6 + 6" = 7t/ 2 we always 
have 

| tan u' | = oo, u — .... (61') 


Thus, if the reflected and refracted rays are perpendicular to each 
other, the plane of vibration of the former will be normal to the inci- 
dent plane; as we may say (see 57'), the component in the plane of 
incidence will not be reflected at all. If, then, we have ordinary light 
— i.e. light whose planes of vibration have any random orientations — 
reflected under these conditions, we obtain plane polarized light whose 
plane of vibration is normal to the plane of incidence. In the older 
literature the incident plane in these circumstances is called the plane 
of polarization, and is the plane of the magnetic vector. 

If we combine Snell’s Law (p. 350) with the condition 0 P + 0 P " = n/2, 
we obtain the following relationship for the polarizing angle 6 V , i.e. the 
angle of incidence for which natural light is converted into plane 
polarized light: 

tan Q P = n (63) 


This is Brewster* 8 Law. For the reflection of ordinary light at any 
other angle we obtain partially polarized light, for then the com- 
ponent in the incident plane is also reflected, although to a different 
degree from the normal component. Since 6 and 6" cannot be greater 
than 7t/2, we have for the reflected light 

| tanw' | > tanu, (G4) 

i.e. the plane of vibration of plane polarized light is turned farther from 
the plane of incidence upon reflection. 

According to (62), we see that the plane of vibration of the re- 
fracted ray is turned toward the incident plane, but never comes quite 
into coincidence with it. Nevertheless, since the direction of this 
turning does not change wheu the ray emerges from a denser medium 
-r-cos (9 — 6") having the same sign, whichever angle is the greater — 
we can polarize ordinary light to a high degree by sending it through 
a number of plane parallel plates separated by air films. This is the 
so-called “ pile of plates ” for polarizing light by refraction. 

For normal incidence Fresnel’s formulae give indeterminate values, 
but the solution of the system of equations (53) to (56) (p. 351) is 
particularly simple in this case. The results are 


. . . (65) 


(S711) 



ia 
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For perpendicular incidence there can be no difference between I A I 
and | A, \ , since the plane of incidence is entirely undetermined in this 
instance. The reflecting power R is defined as the ratio of reflected to 
incident intensity for normal incidence: 


P = r_ _ A ' 2 _ (n — l) 2 
I A 2 (n + I) 2 ’ 


( 60 ) 


Ex. 95. The degree of polarization is defined as the ratio 


P = 


I. + V 

denceof^ 0 ? 116 V&lue ° f ** f ° F reflection from 8 Iass (n = 16 ) afc an angle of inch 

Ex. 96. Show that a balance of energy flow is maintained for reflection and 
refraction by an insulator. 

5. Total Reflection. 

The law of refraction (Snell’s Law), 

sin #" — - sin#, 
n 

leads, when # exceeds a certain value, to values of sin 9” which are 
greater than unity — i.e. to imaginary values of 9" — for light passing 
from an optically denser medium to one which is less dense (n < 1). 
The elementary theory contents itself with this observation and con- 
cludes from it that no light whatsoever passes over into the less dense 
medium — in other words, the reflection is total. But since we have 
already used complex functions in our analysis, we need not be dis- 
turbed by complex values of 9"\ as a matter of fact, we obtain very 
important results by extending the Fresnel formulae to include this 
case. In these formulae (p. 352) we thus put, if sin# > n, 


sin 9" = - 8 in 9, cos 9" = - V sin 2 9 — n 2 


and obtain 


» 


n 


(67) 


^ sin# cos#— sin#" cos#" 

P v sin# cos# + sin#" cos# 


cos 


# — -„A/sin 2 #— n 2 


// 


= A 


n 


• ( 68 ) 


cos # + - 2 y^sm 2 # — n* 


The complex quantity by which A P is multiplied is of the form 


P_ e -ut 

pe+'t 


(69) 


tan</> = 


V sin 2 # — n 2 


where 


n 2 cos # 


(70) 
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As appears from (36) (p. 349), 


E p ' = A p e i *< t - a ' rn »/«>-*+ 


(71) 


the component of E in the incident plane undergoes a change of phase 
of — 2 <f> for total reflection. Similarly we find 


a: = a , 


cos 


9 — iV sin 2 0 — n 2 


cos 9 -j- i V sin 2 0 — n 2 


= A,e~ 2i \ . . (72) 


where 


tan0 = 


V sin 2 0 — n 2 
cos0 


(73) 


Thus the phase changes are not the same for both components; the 
components are displaced relative to one another. But this means that 
the originally linearly polarized light, in which the electric vector 
remains in a given line in a plane normal to the ray, becomes ellipti- 
cally polarized, i.e. the end point of E describes an ellipse. The phase 
difference 8 of the components is given, according to the above, by 


8= 2(<j> — ^). 


(74) 


In order to determine 8 we consider the equation 

tanJ=tan(*-*,= = V ^ 0 ~ n 'G*~ 

^ 1 -f* tan <f> tan «/r 


cos0 


After a simple reduction this becomes 


l n 


sin 2 0 — n 2 \ 
cos 2 6 ) 


(75) 


__ cosdV sin 2 9 — n a 

2 sin 2 6 


.... (76) 

f“ C ®^ e . C0 “ pl « z “umbers by which A, and A, are multiplied have 

1 j,i«“ p “ o - no * “ d «» *-■*- 

The results obtained by applying the Fresnel formula, to the second 
medium where-according to the elementary theory-nothing should 
occur, seem quite remarkable at first sight. Since the aS “ 


cos»^= coshft sin i,8= t sink ft cosh 2 £ _ sinh 2 ^= 1 , 


(77) 


i.e. 


sin 0" = cosh ft = 


sin0 

n 


cos 0 " = - i sinh 0 = - £ . , (78) 




This represents a non-homogeneous wave whose amplitude decreases 
with extreme rapidity as it penetrates into the second medium. A 
numerical example for glass and air (n = 2/3) shows that for d = 60° 
the amplitude decreases to 1/180 in travelling a distance equal to the 
wave-length. These waves are propagated along the z-axis with the 
phase velocity c/(n 2 cosh /}). It becomes easier to understand that there 
is a wave run ni ng along the interface in the second medium if we re- 
member that a similar condition arises in the first medium when we 
have an u n li mi ted beam. The following transformation (cf. p. 67) shows 
that the superposition of incident and reflected waves results in a con- 
dition which may be looked upon as a harmonically modulated wave 
front travelling along the boundary: 


E P + EJ 


= aA‘~ x ~ 


sin Hz coo V 
cln. 


+ A n e 


iub — 


* sin 0 — z cosfl 
cln . 


) + it. 



The state of affairs in both media may be represented by a diagram of 


the type shown in fig. 2, in which the 
phenomenon in the second medium is 
represented as a continuation of the wave 
in the first medium. With regard to 
experimental attempts to verify the exis- 
tence of the “ layer of light ” in the 
second medium, the objection might be 
raised that the wave would be disturbed 
by any such experiment, and hence the 
reflection in the first medium would no 



longer be total. Nevertheless, it would Flg - 2 

be justifiable to extrapolate from a very 

Bmall disturbance in the second medium to no disturbance whatsoever. 
As long as no disturbance is present there will, on the average, be no 
transfer of energy into the second medium; for if the Poynting vector 
S is calculated it is found that at any instant there are places where 
energy enters alternating with places where energy leaves, and that 
these locations themselves change periodically. 

Ex. 97. What index of refraction with respect to air must a substance have in 
order that circularly polarized light be obtained by one total reflection ? Atwhat 
angle must light be incident on the surface of separation of glass and air (» - 
in order to obtain circularly polarized light by two total reflections ? 
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6. Absorbing Media. The Optics of Metals. 

In considering the propagation of plane waves in conducting 
media (Chap. XVII, § 4, p. 333) it proved necessary to introduce a 
complex index of refraction, the quantities n and k being functions 
of K and a. For optical frequencies, however, we are not permitted 
to use the statical values, since these quantities vary considerably 
with frequency for short waves. For this reason we characterize a 
conductor by n and k instead of by E and cr in the optical region. 
If desired, these quantities might be calculated from n and k , using 
formula (23) (p. 334), but this would yield nothing essentially new 
for the optical region. The connexion with the electrical values does 
become effective, however, at the long infra-red frequencies. 

We must thus substitute 


sin#" = 


sm# 


n{ 1 - i K y 


• (81) 


cos 


e " = n(l - i K ) Vn 'V - iK ? - sip2g - • • (82) 

in the Fresnel formulae. The ratio A,' /A,’ is again complex, i.e. there 
IS again a phase difference between the components in and perpen- 
dicuiar to the plane of incidence, as in the case of total reflection. The 
reflected light is elliptically polarized. 

We thus write 


A 




_ __ A, cos (# — #") _ A, 


A v cos (0 -f 0") 


- pe <* 


(83) 


ram ° ve difference in phase by inserting a compensator (a 
f * ab y ,? ufc “ ysta ! P 1 ^) wh ich produces the opposite phase dif- 

aocorthng^to 1 (83), aZmm4ib of the restored P lane o{ vibration becomes, 

# 

m m a ft 

A, 


| tan u' | = 


= P 


By writing the complex number 


4 , 


(84) 


cos (8 — 8") 
cos ( 8 + 8") 

m the form pe“ we can find the phase difference 8 as well . i 

<■ °< m «gi. oi Z X 

ellipse of vibration. In general, t his ellinse xdll v. Q _i j • 
arbitrary position in the plane normal to M'.^here is^ho^ver 1119 
particular angle-called the principal angle of incidence 8* for which 
the ellipse is so placed that one axis falls in the plane of incidence. In 



358 


FIELD THEORY OF PHENOMENA [Chap. 

this case the phase difference between A; and A is exactly „/2 The 

azimuth of the plane of vibration restored by inserting a compensator 

which nullifies the phase diflerence w/2 (a so-called quarter-wave 

plate, usually made of mica) is called the principal azimuth u'* the 

azimuth of the incident light being assumed to be that of the plane 
at 4o . Hence r 


1 + tan #* tan # *" 

A v ' 9 ^ 1 - tan #* tan #*"* * * < 85 ) 


It follows from this that 

tan #* tan #*" = - p ~t~.- = e 2< x 

P ~ i 

where tan y = - = cot u'* 

P 

(cf. equation (84), p. 357), and hence 



X (87) 

The auxiliary angle x introduced here as an aid to calculation is 
thus identical with the complement of the principal azimuth «'*. 
If we substitute the values of sin#*" and cos#*" from equations (81) 
and (82) in the expression tan#* tan#*", we have 


tan # * sin 6* 

Vw 2 ( 1 — ik) 2 — sin 2 #* 




Separation of real and imaginary parts gives two equations for the 
unknowns n and k in terms of the observable quantities #* and u'*. 
The actual values of n and k are such that | (n — ik) 2 | > sin 2 #*. 
Hence, if we neglect sin 2 #* under the radical, we obtain very simple 
approximate formulae which are sufficiently accurate for most pur- 
poses. Thus 


tan#* sin#* 
n(l — ik) 




Comparing moduli and amplitudes of the complex numbers, we obtain 


It follows that 


tan#* sin#* = nV\ k 2 | 

k = tan J 

n = tan#* sin#* cos2x 
k = tan 2 ^ 


. . (90) 

. • ( 91 ) 
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From equation (65) (p. 353) the reflecting power of metals is seen 
to be determined by 



n — 1 — in-K = ae<* = a i{a _ 0) 
n + 1 — iriK be' 0 b 



The intensity depends only upon the ratio of the squares of the ampli- 
tudes. The reflecting power is thus given by 

R = ( a Y = ( n — !) 2 + ft 2 * 2 = n 2 -f u 2 k 2 + 1 — 2 n 

\6/ {n + l) 2 -f- w 2 * 2 n 2 -f nV + 1 + 2n { ] 

The greater the value of /c, the nearer the value of this fraction ap- 
proaches to unity. Hence all wave-lengths which are strongly absorbed 
are strongly reflected. Hence the colours of such substances by trans- 
mitted and by reflected light are complementary, and a thin film of 
gold, for example, appears blue by transmitted light. 

If a plane wave is incident obliquely on the surface separating a metal from 
a vacuum, the refracted wave is inhomogeneous, i.e. the planes of equal amplitude 
are no longer identical with the planes of equal phase. This is the case because 
the various parts of the wave front (which is a plane of constant phase) have 
traversed different thicknesses of the absorbing material. Reviewing the derivation 
of the law of refraction (p. 349), one sees that this law must hold even when the 
index is complex; indeed, the Fresnel formulae, which involve the refractive index, 
were applied to metals. On the other hand, if we examine the sine relation with- 
out troubling about absorption, a lengthy calculation shows that the ratio of 
ernes depends on the angle between the planes of equal phase and the planes of 
equal amplitude, and so must depend on the angle of incidence. Hence for the 
refraction itself the familiar law no longer holds. Because of the strong absorp- 
tion, however, the refracted waves are only slightly inhomogeneous, so that tiffs 
remarkable phenomenon is very inconsequential in practice. 



CHAPTER XIX 


jr 


Electromagnetic Waves. Ill: Propagation m Anisotropic 

Media. The Optics of Crystals 


The Field Equations for Anisotropic Bodies. 

For anisotropic media — we have in view chiefly crystals which do 
not belong to the regular system— the relationships are more complex 
in that the polarizability of the atoms cannot be expressed by a simple 
scalar, as in the case of a free atom, for which this quantity averaged 
over all orientations is the same in all directions. Since it is part of 
a crystal lattice, the atom is no longer perfectly free to turn and hence 
it is no longer possible to average over all orientations. Also, in addi- 
tion to that due to the character of the atom itself, a further element 
of anisotropy may be introduced by the presence of neighbouring atoms. 
If the polarization is different for the various components of E, the 
vector P no longer has the direction of E. One thing, however, 
must be preserved: the linear connexion between P and E. P thus 
becomes a linear vector function of E, and hence the susceptibility 
is a tensor: 

P = TE (1) 


But the vector of the dielectric displacement D also becomes a linear 
function of E, and the dielectric constant is now a tensor: 

D = E + 4ttP = E -f AttTE = <PE (2) 


In component form, 

D x = K n E x + K 12 E V + K l9 E x 

D v = K 2 l E x -f- K 22 E V -f- K 22 E z /■. ... (3) 

D z = K ZI E X + ^Z2^V + -^ 33 ^* ' 

Since we deal principally with transparent crystals, ferromagnetic 
bodies, which at the same time possess metaBi^conductivity, do not 
come into consideration, and we can set the permeability equal to 
unity. With this stipulation, the second field equation remains un- 
changed. How must the first field equation be written for the present 
case? Since every change of polarization means a motion of eleotno 
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charges, or a current, of density dVjdt, we have in general for all media 

1 9D 

% a t' • * * 


, _ 1 3 E , 4tt ap 

curl H = - — H — 

c dt c dt 


(4) 


If we consider the derivation of the field equations in Chap. XVIII, 
§ 1 (p. 343), which was based upon perfectly general principles, we see 
that in order to obtain the form (4) we must make the following as- 
sumption for the energy density in the general case, i.e. for anisotropic 
media: 


« el = q~~ ED 

07 T 


(5) 


The energy flux vector, which contains only E and H, continues to 
be given by 

L [EH]. 


The conservation of energy is thus expressed by 

/£[EH]<*S=-1 i-/(ED+BP)*. 


. . ( 6 ) 


or 


~ f(B curl E - E curl H)(7r = - 1 i- J (ED + K‘)dr 


~ ^/( E li +D 


se sii 
dt + 2B 37 


jdr. . . 


(7) 


We again obtain the field equations as on p. 344 by equating separately 

018 by Jffi Ch u E a ? d H , are “ ulti P lied - For the Be °ond equation 
ody if 6rS n0 but the first e I uat i° n in the form (4) results 


E 0 D -n 0E 

E a7- D ar 


( 8 ) 


fol^e^on.^ " 87mmetriC teMOr ’ fOT H the 


condition, viz. 

XT 7 7 dffic I rr n dE v 

K U*X -gj- + KizE x v 


dE. 


+ K 13 E. «- 


dE 


dt 

+ -gf + K r ,E, 4 . d[ 
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^u E x + K u E v 

= 1 + k 21 e x a A + 




0£ 


d E 

+ K 31 E x ~ -f- K 32 E V -f- K 33 E Z 


dt 

dE z 


v -+K 25 E z dEv 


dt 
d£ 
dt j 


(S') 


is to be valid for all values of E Xi E v and E g , we must have 

K i k=K kt (9) 

In this case the connexion between D and E may be visualized by 
means of the tensor ellipsoid. If we put E = r* and draw the surface, 

Dy ~ E u x2 + 2 -S’i 2 x y + K 22 y 2 + 2K 23 yz + K 33 z 2 -f 2K 31 zx = 1, (10) 


then, according to p. 37, the direction of D is given by the normals to 
the surface, while its magnitude is the reciprocal of the projection of 
r on the normal. In what follows we shall employ only the co-ordinate 
system of the principal axes, in which system the equation of the 
ellipsoid is 

E x x' 2 K n y' 2 + K m z' z = 1. . . . (11) 

K ly K u , K m are called the principal dielectric constants of the crystal. 
The axes of the ellipsoid are proportional to the quantities 

1 1 1 

vk; vr~’ v/c 


Now, in an isotropic body, the velocity of light is c/VK . Similarly, 
we call the quantities 


c c c 

VW; vJn’ veZ’ 

the principal velocities of light in the crystal. The axes of this so- 
called Fresnel ellipsoid are thus proportional to the principal veloci- 
ties of light. In the system of the principal axes we have the simple 
relationships : 

D* = KjE^, Dy- = K n E y ', D, = K m E, % . (12) 

However, since K ly K n and K m are unequal, the resulting vector D will 
no longer be parallel to E. 

♦ Of course, this relationship must contain some dimensional factor, since we 
cannot set an electrical field strength equal to a length. Such a factor, of magnitude 
unity, is assumed to be contained in this equation. 
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In this same co-ordinate system we also have, by equa- 
tion (12): J * 


Er ~K l Daft Ey ' = T D 


1 


v , E r — D rt 

u 


. (13) 


i.e. the principal axes of the Fresnel ellipsoid are also the principal 
axes of that tensor ellipsoid which, reciprocally, gives E as a function 
of D. Its equation is 


K, + K+ K~ L 


• • • 


(14) 


Thus the axes are proportional to the square roots of the principal 

dielectric constants— that is, the principal indices of refraction. The 

fagure w thus called the index ellipsoid. The corresponding tensor is 
denoted by O" 1 , and so r 6 


E = 0“ 1 D. 


(15) 


[/Plane Electromagnetic Waves in Anisotropic Media. 

We assume the electric vector to be representable by a plane wave 
whose planes of equal phase are taken normal to the unit vector n 

Thus wet^ 6 pr ° pagated “ the Motion of w with the velocity e. 


£=£.6“ O'"). 


• ( 16 ) 


• ( 17 ) 


We then have, from the second field equation, 

~di ~ ~ c curl Eoe 1 " ('-?•)= c [eo, grade*" ('“?)]. 

Since grad ™ = «, we obtain after integration with respect to t* 

H = ;^(-™) [ «B 0 ]= Ho ^(«-a), . . (18) 


and 


*0 = * [»Eq]. 


Thus the magnetic vector is normal to the plane of 

• For if we put ra - grad u becomes equal to n and 

grad * (.-*>- -fi? e )grad 


' ■ ( 19 ) 

E and n. 


i»e. 


[E.. graded (»-«./.)] - 
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nowever, we do not yet have any information regarding 

between E and n. From the first field equation, 


0D 

J7= c curl H = c curl H 0 e“" 




^ C eK-?) 

V 


[nK Q ]. (20) 


Integrating, and inserting (19), we obtain 


D = - $ [«[«E o] ] = D 0 <>(-?) 


• ( 21 ) 


This signifies that D is normal to n. With respect to D and H we thus 
have pure transverse waves. From (21) we obtain an equation which 
is of fundamental importance in what follows: * 


c 2 


D = -2 (E - « • nE), 
2 


or 


t; 


— D — E + « * nE = 0 


. . ( 22 ) 


Tliis equation states that D, E and n 
lie in the same plane — H being normal 
to this plane, as we saw above. But 
the vector 

s = i ^ 


also lies in this plane; moreover, S 
forms the same angle 9 with n as does 
E with D (cf. fig. 1). S gives the direc- 
tion of energy flow; this means that a 
bounded portion of the wave front 
travels in this direction w hil e its normal 
continues to have the direction of n. 

The phenomenon is to be compared 
with a company of soldiers executing a 

44 right oblique”. As may be seen directly from the figure, the velocity 
of the energy flow is 

V (23) 



V = 


cos 9 


V is also called the ray velocity. 

If we decompose (22) into components referred to the prin- 
cipal system of axes, with which we suppose n to make angles 


# The subscript 0 is now omitted, since the equation is true for the instantaneous 
raiuos E, H and D as well as for the amplitudes E* H 0 and D». 
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a, p, y* anc * ^ we express E in terms of D in the second term, 
then 


a ” 2 ) ° m ~ 


nE cos a or D, — 


nE cos a 


Similarly, 




D.= 


1 

v 2 

Kr 

c 2 

nE cos p 

1 

V 2 

Kn 

c 2 

nE cosy 

1 

V 2 

Km 

<* 


. (24) 


The scalar product D n vanishes, since D is perpendicular to n. If we 

express this product in terms of the components given by (24), and 

if we divide by the non-vanishing product Eh, we obtain the relation- 
ship 



This is a quadratic equation for v 2 in terms of the components of n ; 
hence to every wave normal there correspond, in general, two phase 
velocities. For the direction of the z-axis, for example, we have 
cos a = 1, cos ^3 = cos y = 0. Hence 



v 



We may therefore replace c/v%, ... by the principal velocities 
Vj, v a and v m , and thus obtain the more elegant form 


cos 2 a ^ cos 2 p f cos 2 y 

V 2 — V 2 + V* — V i + V 2 — V 2 = 0 * 


(25') 


If we know the values of w corresponding to a given direction— call 

them n and we obtain the directions of the corresponding vectors 

D or D by substituting v or v" in the following equations which 
come from (24) : 


D x : D v : D t = 


cos a 


cos p 


cosy 


t). 2 — w 2 ' « u 2 — V 2 ' v m 2 — V 2 ' , 


(26) 


These two directions are mutually perpendicular, for we hav? 
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D'D" = (nE')(**E")c 4 


COS 2 a 


( V 2 — V '2)( v 2_ v "2) + (t> n * - V' 2 ) ( V - „"2j 


COS 2 P 


-F 


COS 2 y 


(V m 2 - V' 2 ) (v m 2 - v" 2 ) - • 


Now we also have the identity 


cos 2 a cos' 
v 2 — v’ 2 ~ v 2 — v" 2 


Q _ (t/ 2 — v" 2 ) COS 2 a 


{v 2 _ „' 2 ) ( Vi 2 _ v »2y 


whence 


(27) 


(28) 


D'D" = 


(nE'){nE") J cos 2 a 
v’ 2 — v" 2 C |«_ 2_,/2 


COS 2 a ^ COS 2 p 


v 2 — v' 2 v 2 — v" 2 1 V 


n 


2 — v' 2 


+ 


COS 2 y 


COS 2 y 


V m 2 — V' 2 


% 2 - V " 2 


COS 2 p 
v u 2 v" 2 

. (27') 


But since v ' and v" are the roots of equation (25'), the sums of the 
positive and negative terms vanish separately which proves our state- 
ment. 


The direction of D may be determined geometrically, using the index ellipsoid: 
Cut the ellipsoid by a plane normal to n. The principal axes of the resulting ellip- 
tical section give the directions of D' and D". Proof: The principal axes are 
characterixed by the fact that r 2 is an extremum for these directions, with the 


auxiliary condition that the terminus of r is in the plane rn = 0 as well as on 
the ellipsoid r<D -1 r = 1. The extremal condition gives 

r8r = 0, (29) 

and the variations of the auxiliary conditions are 

n8r = 0 and = 0 * (30) 

Introducing the Lagrangian multipliers X and {X, we obtain the equation 

r + In -f (x<I > -1 r = 0 (31) 


Scalar multiplication by r and also by n yields the following equations, sinoe 
rn = 0 and rQ>~ x r = 1: 

(x = -r», 

X = r 2 nQ>~ l r, 

whence — n . n<f>~ l r — <£>~ l r = 0 (32) 

r 3 

Since r= D/p in the index ellipsoid — p being a factor of proportionality we 
obtain f 

— <X> _1 D 4- ft . «<X> -1 D = — E -f- n . ttE = 0. . (33) 

• For according to the way the index ellipsoid was introduced, E - <t>" 1 (>') has the 
direction of the normal, and the condition that the variation Sr remains on the ellip- 
soid means that Sr must be perpendicular to E. 

f The introduction of the factor p is neoessary when comparing equation ( 33 ) witn 
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Identifying p*/!) 1 with v*/c 8 , we see that this ia precisely equation (22) (p. 364 ). 
Hence the construction is justified. 

Having thus found the two vectors D' and D" corresponding to 
a normal tt, the directions of the associated vectors E' and E" are 
given as those of the normals to the ellipsoid through the ter mini of 
D' and D". These are, in general, not mutually perpendicular. The 
vectors H' and H" are normal to the planes of D' and tt and D" and tt 
respectively. There are two rays corresponding to each wave-normal, 
viz.: 

S' = -p- [E'H'] and S" = [E"H"]. 

477 477 


We designate the corresponding unit vectors by s' and s”. 

While the direction of tt leaves an ambiguity, that of D determines 
tt as well as E and S, and the magnitude of v uniquely; for tt, on 
one hand, must be in the plane normal to D, and, in addition, the 
vector E is uniquely determined for a given D by the index ellipsoid. 
Since tt must lie also in the plane of D and E, the intersection of the 
two planes gives the direction of tt. The magnitude of v is obt ain ed 
most simply from equation (22) (p. 364): If we multiply this equation 
scalarly by D, we have 


cyED yDO^D 

Vd* c ~VW ’ 


• • • • 



On the other hand, if we start with the direction s of the ray, we 
obtain completely analogous relationships in which D and E have 
interchanged their r61es. The fact that D, E and s lie in one plane 
. may be expressed by the equation (of. Ex. 2, p. 10): 

D + aE + ps = 0 (35) 

The coefficients a and p are obtained by scalar multiplication of this 

equation by s and by tt in turn: Since 5 is normal to E, multiplication 
by s gives 

and since n is normal to D, multiplication by n yields, after inserting 
the value of )S, 6 

a= ( D<y ) ( sn ) 

Ett * 


But from equation (22) (p. 364) we have 

D * = - j-5 (»s) (Em), 

S;K". We PUt " - D - 00 m P a ™ on of equation, yield, the 
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so that 


Then we have 


c 2 r 2 

a=— - [ns) 2 = — - cos 2 6 = 

V" v 2 


c 2 

■=jzE — B s .sB = 0 


c* 
V 2 ' 



Comparison of this equation with (22) shows that D and E are inter- 
changed, and that c 2 /F 2 appears in place of v 2 /c 2 . 

If we denote the components of 5 — i.e. the direction cosines of the 
ray — by cos A, cos B, cost, and express the first D in (36) in terms of 
E, we obtain the component equations 



Ds cos A 


Ki~ 


V 2 



Ds cos B 

7* * 

K — — 
|'2 



d$ cos r 




Using these components to express the vanishing of the scalar pro- 
duct Es and replacing K ly ... by c 2 /^ 2 , . . . , we obtain the 
equation 



This is a quadratic equation for V 2 as a function of the direction of 
the ray. The directions of the corresponding E' and E" are obtained 
as for D' and D" above, either analytically from equation (37) or geo- 
metrically as the axes of the ellipse cut from the Fresnel ellipsoid by 
a plane through the centre and normal to 5. The corresponding vectors 
D' and D", which are not mutually perpendicular, are then found 
from the ellipsoid in the usual way. As demonstrated for D above, it 
also may be shown that E uniquely determines the wave. 


►. Normal Surface and Wave Surface. The Optical Axes. 


We now consider an optical disturbance spreading outwards from 
a point. This point need not be an actual source of light, but, as we 
shall see in the next chapter, may be any point of a wave front. We 
shall find that under certain conditions the so-called Principle of 
Huygens permits us to consider any such point as a new source of 
light. We may represent the light emanating from 0 in such way that 
a surface element — a portion of a plane wave front — travels in each 
direction. Aiter one second, these wave fronts have travelled a dis- 
tance v in the direction of their normals. If we measure off the corre- 
sponding segments OP = r = v on these normals, the end points 
form a surface called the normal surface. However, each of the actua 
elementary wave fronts does not lie on this surface, but somewhere 
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to the side of the corresponding point P (see fig. 2)/ Moreover, all 
elementary wave fronts emanating from 0 lie on a new surface called 
the ray surface or wave surface. First let us consider the normal sur- 
face. We obtain its equation by putting r in place of v and xjr t , . . 
in place of cos a, ... in equation (25') (p. 363): 

(% 2 -f 2 ) (Vn?- 


r 2 )x 2 + (u m 2 — r 2 ) {v 2 — r 2 )y 2 

+ [v 2 — r 2 ) (v u 2 — r 2 ) z 2 = 0. 


(39) 


This is a surface of the sixth degree 
and consists of two sheets, corre- 
sponding to the fact that there are 
two phase velocities for every direc- 
tion. We wish to consider the curves 
of intersection of this surface with 
the co-ordinate planes, which — as 
planes of symmetry of the tensor 
ellipsoid — are of particular impor- 
tance. The intersection with the 
plane y = 0 , for example, yields 

(v n 2 — r 2 ) (v ra 2 — r 2 ) x 2 
+ W - r 2 ) (ty 5 - r 2 )z 2 = 0. (39') 

This curve consists of the circle r 2 = 
v a 2 and the oval of fourth degree 
(v m 2 - r 2 ) a* -f (v 2 - r 2 ) z 2 = 0. If 


Surface 



Wave Surface 


Fig. 2 


we now assume that v 1 < v n < ly we see that the circle and the oval 
have real points of intersection in the plane y = 0, and only in this 
plane; for if we put r = v n in the equation of the oval, the coefficients 
of x 2 and z 2 have different signs, and so the equation can be satisfied 
by real values of x and z. The directions of the lines drawn from the 
centre to these points of intersection are given by 




Z v 2 ( 40 ) 

These directions are called the optical axes , since in these directions 
there is but one phase velocity v. The wave normals coincide with one 
of the axes, and we take the one that goes with the positive sign of the 
root. The direction cosiAes of the normals are 


cos a = 


I 


V(1 + tan 2 a) 


V 


cos & = 0, 


4 


(41) 


Setting these values into (25') we get the single value of the phase 
velocity v v —v u . But then the Y-oomponent of D is indeter- 
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minate, since cos p as well as (1 /K n ) - (v„*/c*) vanish in (24). For 

\\t[ Dx We as We must ’ ~ cota > since D is always normal to n. 
Whatever value we assign to the Y-component, D remains perpen- 
dicular to n, and so D may assume any direction in the plane normal 
to n. 

On the other hand, the geometric construction must always furnish 
two axes of the ellipse of intersection. The contradiction is only an 
apparent one, the explanation being that the elliptical sections are 
circles in this case. Hence the optical axes are normal to those sections 
of the index ellipsoid which are circles. 

Since the axial direction lies in the zz-plane, the y-axis is one of 
the diameters of these circular sections. This is important in what 
follows. But since a circle has no uniquely determined axes, the direc- 
tion of D is undetermined. Thus the associated ray is also undeter- 
mined. It is not correct to conclude, however, that the various possible 
directions for the rays are obtained by rotating the entire vector 
system of fig. 1 (p. 364) about w, for the angle between D and E — 
which is the same as that between n and s — depends entirely upon 
the position of D. Moreover, n is not the axis of the cone containing 
the several rays, but is itself one of the generators; since the y-axis 
is a diameter of a circular section, it is also a permissible direction for 
D, but D and E have the same direction in this instance, so that 5 
and n coincide. 

If the magnitude of v m approaches that of v n , the angle between 
the two axes becomes smaller and smaller; in the limit v m = v u , both 
axes coincide with the z-axis, and the circle touches the oval at the end 
points of the axes which coincide. This is the case for a so-called 
optically uniaxial crystal. Here the Fresnel ellipsoid and the index 
ellipsoid are figures of rotation and, as seen from equation (39) above, 
the normal surface degenerates into the sphere r 2 = v n 2 and the fourth- 
order surface of rotation 

(v u 2 — r 2 ) x 2 -f- (Vj 2 — r 2 ) (y 2 -f z 2 ) = 0. . . (39") 

Of greater importance than the normal surface is the ray surface oi 
wave surface , obtained by laying ofi the ray velocity along the direc- 
tion of the ray. The elements of the actual wave front are at these 
points, and are given by the tangent planes to this surface. The feet 
of perpendiculars drawn to these planes from the origin lie in the 
normal surface, which is therefore a so-called pedal-surface of the wave 
surface. The equation of the wave surface is obtained by putting 
r for V and x/r . . . for cos A ... in equation (38) (p. 368) : 

v, 2 (v u 2 - r 2 ) ( v m 2 - r 2 )x 2 + v u 2 (v n 2 - r 2 ) (v 2 - r 2 )y* 

+ 0. . . - (42) 
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The term of lowest degree is v^v a 2 v n ^(afi -f* y 2 z 2 ). Since all other 
terms contain r 2 , we may divide throughout by r 2 , and the degree of 
the ray surface is not six but four. Correspondingly, the curves of 
intersection with the co-ordinate planes are circles and ellipses (cf. fig. 3, 
Plate I, facing p. 240). There is again a plane in which the circle and 
ellipse have real points of intersection. The fines jo inin g the opposite 
pairs of these points are called the optical ray axes. If the ray has 
the direction of one of the optical ray axes, there are infinitely many 
normal directions corresponding to one ray, and these normals form 
the surface of a cone (see below). For uniaxial crystals the wave 
surface degenerates into the sphere r 2 = v n 2 and the prolate ellipsoid 
of rotation 


x* . y 2 -J- z 2 


v, 


.2 i /2 

+ 


V , 


- 1 = 0 , 


(43) 


4. Refraction of Plane Waves at the Plane Surface of an Aniso- 
tropic Medium. 

Since the continuity of the tangential components of E and of H 
(p. 345) must hold for anisotropic media also, we can make use of 
the considerations of § 3, Chap. 

XVIII (p. 349), from which we 
obtain the laws of reflection and 
refraction. It must be remem- 
bered, however, that n denotes 
the direction of the wave nor- 
mal, as may be seen from equa- 
tion (16) (p. 363). Hence, at the 
boundary of an anisotropic body 
the wave normal remains in the 
plane of incidence and obeys 
the Law of Refraction. But the 
recognition of this fact is not of 
much aid in constructing the 
refracted wave normal, for, in 
contrast with isotropic media, 

the value of the index of refraction now depends upon the direc- 
tion of this normal. This means that the direction of the refracted 
normal must be known m advance, in order to find the correct value 

lltV '^ deX ' there “ available a method which 

leads to the desued result. This method is often used to furnish a simple 

r5 Sn f Us . kw *° r lsotr °P io media, and is based upon 
Huygens Principle. According to this principle, which will be dis- 

cussed m detail m the next chapter, each point on a wave front mTy 

be taken to be the source of secondary wavelets: the aotual wave 
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front at any future time may be considered to be the envelope of the 
secondary waves. Let us apply this idea to a plane wave incident upon 
a plane surface of a crystal (fig. 3). We must then consider the secon- 
dary waves emanating from points in the surface, at the same time 
remembering that the arrival of the primary disturbance takes place 
at different times, since points in the same phase are on the line AB. 
By the time the incident wave has arrived at C, the disturbance 
coming from A has already spread out over a wave surface. Assuming 
the upper medium to be a vacuum, this corresponds to a time BC/c. 
The wave surfaces belonging to points lying between A and C should 
be drawn in corresponding ratio, but since the envelope of these wave 
surfaces is a plane passing through C, it is sufficient to take the plane 
through C tangent to the wave surface of A as the new wave front. 
Since the wave front is of indefinite extent in a direction normal to 
the plane of the figure, we really mean the plane containing the line 
through C normal to the diagram and tangent to the family of wave 
surfaces originating in points on a line through A normal to the figure.- 



Imagine a ray determined by a small surface element in the neigh- 
bourhood of A. After refraction, this element of the wave front must 
contain the point of tangency of the wave front with the wave sur- 
face corresponding to the point A. What happens for one wave sur- 
face in a plane normal to that of fig. 3 is shown in fig. 4. The top line 
in the latter diagram projects as the point A in the former figure. 
The two points of contact — one corresponding to each sheet — do not, 
in general, lie in the plane of fig. 4. Hence a refracted ray does not , in 
general , follow the law of refraction. 

According to what we have just seen, it is important to differentiate 
between refraction of the normal and refraction of a ray, i.e. a limited 
portion of a wave front. This difference becomes evident if we con- 
sider the simplest instance of the passage of fight through a plane 
parallel plate of anisotropic material. If a plane wave is incident 
normally upon such a plate, the normal will not be deviated; never- 
theless, there are two wave planes within the crystal perpendicular 
to the normal direction. These correspond to the two velocities of 
propagation belonging to each direction. If a plane parallel crystal 
plate is inserted in the parallel bundle of rays between the collimator 
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and telescope of a spectroscope, no double refraction is observed. 
The effect is quite different, however, if a narrow bundle of rays is 
used in place of a wide beam ; in this instance double refraction takes 
place, for to each normal direction there correspond, in general, two 
rays. The results are particularly remarkable when the wave normal 
is in the direction of one of the optic axes of a biaxial crystal. In this 
case there are infinitely many rays corresponding to the normal, for 
it is possible to draw a tangent plane to the two sheets of the ray 
surface such that the plane touches the surface on a circle (a portion 
of this circle is seen in fig. 3, Plate I). Hence any ray from the 
centre which passes through this circle corresponds to this plane 
wave front, and the transmitted rays form the surface of a cone, 
which in turn is refracted into the form of a hollow cylinder of light 
upon emerging. This phenomenon is known as internal conical refraction. 

The effect reciprocal to double refraction of a ray is obtained by 
allowing light to enter a biaxial crystal in all directions through a 
given point on the surface of the plate. This may be done by diffuse 
ill umin ation of a pinhole in a piece of cardboard placed on the plate. 
A given ray may be isolated by placing a similar diaphragm on the 
opposite side of the plate. This ray, however, will experience double 
refraction on emerging, since there are two different corresponding 
normal directions, each having its own value of the refractive index. 
If the transmitted ray is allowed to have the direction of one of the 
optical ray axes, there will be an infinity of normals corresponding to 
this «;o-called single-ray direction. This ray is directed from the centre 
to the “ dimple ” in the wave surface (fig. 3, Plate I), at which point 
it is possible to draw an infinite number of planes tangent to the 
surface. The emerging rays fill a cone, and this phenomenon is known 
as external conical refraction. These conical refraction effects were 
predicted by Hamilton and were verified by Lloyd, using a crystal of 
aragonite. 

In the previous discussion we tacitly assumed the incident fight 
to be unpolarized, i.e. there was no uniquely characterized plane of 
vibration of E. Let us return to a consideration of the normal 
mcidence of a plane wave upon a plane parallel plate. There are 
two mutually perpendicular directions of D; since these are both 
normal to n they fie in the surface of the plate. As before, we 
denote these two vectors by D' and D". D and n also determine 
the plane of vibration of H (cf. fig. 1, p. 364). Vlf, now, the incident 
ght is linearly polarized, with its magnetic vector for min g an angle u 
with one of the vectors D, then we can resolve the magnetic vector 
into two components, one along D' and one along D". But eaoh 
of the components has its own phase velocity, corresponding to 
the two different velocities belonging to n. As a result, there is 
a difference in phase when the fight emerges — the original linearly 
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. polarized light has become elliptically polarized. On the other 
hand, by introducing a phase difference in this manner, we can 
convert elliptically polarized light into plane polarized light. This 
is the principle of so-called compensators for polarized light. 

Finally, we wish to consider briefly the beautiful patterns shown 
by plates of anisotropic materials when placed between crossed Nicol 
prisms. If we allow polarized light to fall upon a crystal plate at angle 
of incidence </>, then within the crystal there are two distinct normal 
directions, each with one plane of vibration of D, and hence of 
E.* In general, the plane of vibration of the incident light will not 
coincide with either of these directions, and so there will be a component 



Fie- 5 


along each. Both wave normals again have the same direction upon 
emerging, since we can apply the same construction here as we did 
at the point of entry into the crystal (reversibility of the rays). But 
since the two waves have travelled over different paths within the 
crystal they will differ in phase; this means that the emergent light is, 
in general, elliptically polarized (cf. the special case of normal incidence 
treated above). From fig. 5 we see that the phase difference is 
given by 


— (?+ 


BD 


AC\ 

"TV- 


Denoting the angles between the normal to the surface and the re- 

* It would be wrong to assume that there are two planes of vibration corresponding 
to each of the refracted normals. As a matter of fact, the construction of the refr^ted 
wave front shows that corresponding to each of the two normals thore isa 
normal velocity v, and hence by equation (26) (p. 365), a definite direction of D. 
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fracted wave normals by if V and ip" respectively, and calling the thick- 
ness of the plate d, we have 

AB = — — -r-.i 40=—^; BD = BC sin 4> = d (tan <j," - tan </.') sin <£, 


cos ip° 


COS I p 


i.e. 

g __ 7 1 /sin</r" smip _ 1 \ 1 __ /sin ip' sin <£ __ 1 \ 1 

^ |\ c v") cos fj" \ c v' ) cos p' 

Since the law of refraction holds for the wave normal. 


Bmp _ sin i ft" _ sin */»' 
c 


V 


// 


V 


/ 9 


or 


_ , / COS ip' COS 0" 


S = cod 


v 


«£1 

v" r 


. (44) 


and for the co 


Hill 


on case of approximately normal incidence, 



If the phase difference is 2mr the emergent wave is again linearly 
polarized in the original plane, and can be extinguished by a second 
Nicol prism turned at an angle of n/2 with respect to the polarizer. 
If we allow convergent polarized light to fall upon the crystal, there 
will be darkness for all directions (f> for which the angles of refraction 
t/r' and ip" yield 8 = 2mr according to (44). These directions then give 
dark curves in the focal plane of a lens. But these are not the only 
dark curves. There is a second possibility: for a given angle of inci- 
dence the plane of vibration of the incident wave can coincide with 
one of the possible planes of vibration in the crystal. In this case 
there is no second wave to bring about elliptic polarization. The 
resulting dark curve is called an isogyre. 


A ? uniaxial crystal is cut perpendicular to the axis. Show 

Zn.iT 1 * CT 03 !u d a sy8tem of ciroular riD S 3 be obtained, with a rect- 
angular cross through their centre. 



CHAPTER XX 



Electromagnetic Waves. IV: The Theory op Diffraction 

The General Diffraction Problem and Attempts to solve it Kirch- 
hoff’s Formula. 


In previous sections we have dealt almost always with wave fronts 
of unlimited extent. In those cases where it proved necessary to 
consider limited beams, in the study of crystal optics, for example, 
we assumed that the limited beam was propagated in the same way 
as an extended one, and that the cross-section of the beam remained 
as determined by the aperture through which the light passed. Actually, 
this is not strictly true. 'Simple experiments show that light enters 
that portion of space which, according to our previous assumptions, 
could not be reached by the light wave — the region of the so-called 
geometric shadow. This phenomenon is called diffraction. / 

' Mathematically, the problem of the diffraction of a wave by an 
obstacle is to be formulated in this way: we seek a solution of the 
wave equation for E and H which satisfies the differential equation 
both for the region within the diffracting body, or obstacle, and for 
the rest of space; and which, moreover, satisfies the condition of con- 
tinuity of the tangential components of E and H at the bounding 
surface. At the same time it must be remembered that the electrical 
constants ( K , a) are different for the two regions.' If it were possible 
to find such a solution, it would be expected to furnish information 
on all questions of intensity, polarization, &c., behind the obstacle. 
However, the mathematical difficulties connected with such a solution 
are so great that it has been possible, so far, to develop solutions only 
in especially simple cases, such as the diffraction of a plane wave 
meeting a sphere or a cylinder or a screen. 

'For the greater number of practically important diffraction prob- 
lems we have to fall back upon an approximate method, which proves 
adequate if the conditions are not too unfavourable. Essentially, 
this method is merely the mathematical formulation of the principle 
that every point of a wave may be considered to be the source of 
secondary spherical waves. This concept we have already utilized in 
the preceding chapter. According to the original Huygens formu- 
lation, the wave front existing after a certain time is given by the 

376 



Chap. XX.] THE THEORY OF DIFFRACTION 377 


surface which envelops the elementary wavelets. We made use of 
the principle in this simple form in our discussion of the refraction of 
light by crystals (p. 371). Fresnel, however, improved the method in 
this way: in order to find the resultant disturbance at a point of obser- 
vation P, he imagined the secondary waves arriving there at a certain 
instant to be superimposed, at the same time making allowance for 
their difference in phase. The mathematical statement of this idea is 
contained in a formula due to Kirchhoff, which we shall now prove. 

^ We represent the fight disturbance by a scalar quantity U; this 
may be interpreted as one of the two components of E, which are 
perpendicular to the wave normal. The only essential fact is that the 
intensity of the fight is proportional to U 2 . U must satisfy the wave 
equation 


A U = 

c 2 dt 2 * 



For the dependence upon time we make the usual assumption for a 
monochromatic vibration 


U=u(x, y, 



and obtain the following differential equation for the spatial part, 
which alone interests us: 


Here 


A u -f- khi = 0. 


£ n <*> 277 


( 3 ) 

W 


Assume now that we have two functions, u and v, which satisfy the 
wave equation. We then apply Gauss’s theorem to the vector quantity 
u gradv — v gradu, integrating over an arbitrary volume in which the 
integrand is continuous. The result is Green’s theorem (p. 270): 

(£ (u grad v — v grad u) dS = J(uAv — vAu) dr 

= J k 2 (vu — uv) dr == 0 . ( 5 ) 

We take the function v to be the spherical wave originating at a point 
P in the region of integration. Apart from the time factor, this is 

e-tk. 


But this function becomes infinite at P; hence if we wish to apply 
equation (5) we must surround this point by a small sphere. The sur- 
face of the sphere is then part of the bounding surface of the region 
of integration, and the point P is thus excluded from this space. The 
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normal is always directed outward from the region of integration- 
hence in this case the normal is directed toward the centre of the small 
sphere (fig. 1 ). We now have the equation 


<£ (u grad e — —grad u'j dS 

i r / e -ikr e ~ikr \ 

+ \ u e rad -7 — ^d « j dS = 0. 


( 7 ) 


The integral over the surface of the small sphere may be calculated 
readily. First we note that 

since u and gradw are con- 
tinuous, the contribution of 
the term 


>—ikr 



- grad u 


vanishes when we allow the 
sphere to become infinitely 
small, since dS decreases as 
r 2 , while we have only the Fig. 1 

first power of r in the de- 
nominator. Again, since u is continuous, we may replace it by the 
value u p at the centre when integrating over the small sphere. Finally, 
we have 

grad * 7 - = ~ Qi + 7 ) e -‘ tr ... ( 8 ) 

where r 0 is a unit vector directed outwards from the centre of the 
sphere. Hence we have 

dS = — r 0 dS = — r 0 r 2 sin 6 d6 d<f> t ... (9) 
and the value of the integral 


u '£ 


,—ikr 


grad — 

Sphere ^ 


dS 


in the limit (r = 0) becomes 4 - 7 TW,,. Hence 


u ' = 4 y. ( 


>-lkr 


,—ikr 


grad u — u grad — 
r r 


Vs. 


( 10 ) 


This is the celebrated formula of Kirchhoff . * It expresses the disturbance 
at a point P as the integral of the light disturbance over the boundary 
of a region containing P. If, for example, we are dealing with the 

•The formula was first proved by Helmholtz, but is a special case of a formula 
due to Kirchhoff, 
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diffraction of light by a small aperture in a screen, the screen would 
be taken as the boundary of the region. But in order to apply the 
formula, the diffraction problem must, in a way, be solved already, for 
one must know the disturbance in the ope nin g and on the surface of 
the screen in advance. KirchhofE made the natural assumption — 
which is amply justified by experience— that the disturbance in the 
free aperture is that which would exist if the screen were not present, 
and that u and grad w are zero on the screen itself. It is evident that 
with an assumption of this kind, which contains nothing concerning 
the material properties of the screen, we cannot attack the more 
delicate questions, such as that of polarization. Nevertheless, if we 
do not require the distribution of light in the immediate neighbourhood 
of the edge, the KirchhofE assumptions prove adequate. 

2. Reciprocal Theorems of the Theory of Diffraction. Classification of 
Diffraction Phenomena. 

We shall now apply the Kirchhoff formula to the special case where 
the primary excitation is from a point source Q and is represented by 


u — 


e ~ ikr * 


Then we have 


(ii) 


grad « = - ^ I e -«r, + ^ r m , 
grad = - (l e -«r + r# 


( 12 ) 


where r, is the radius vector drawn from Q to a point of the aperture 
and r is the radius vector drawn from the point of observation P. 
Ihe corresponding unit vectors are designated by the subscript 0 
(hg* 1). Smce dS = cos {wv q ) dS , we have 

***** 

47 r J a 


47T J Aperture TT t 


{ Q + *) 003 (”r) - + ikj cos («r„)j dS. 


(13) 


and from ZT • 7 T - Cea fr0m the to the aperture 

fr LcZ! w ength - r nm ° n * “ d in aU cases occurring 

“ d »-» “SS! 


i r 

= 2xf 


i re~ ft < r+r f) 


rr 


{cos {nr) - cos (nr,)}dS. 


(H) 
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Assume further that both r and r Q are large compared with the dimen- 
sions of the aperture a condition which is also realized in all practical 
cases. This permits us to look upon r and r Q as constants where they 
occur in the denominator and in the trigonometric functions of the 
integrand; however, this is not allowable where these quantities occur 
in the exponent, since they are multiplied by the large factor k = 2tt/A. 
Bringing the constant values outside the sign of integration, we obtain 
the simpler formula 


«, = *' { co ^r>- cos <■”>■* )} r 


. (15) 


Formulas (13) to (15) are entirely symmetrical in the co-ordinates 
of light source and point of observation — interchanging these quanti- •> 
ties merely alters the algebraic sign. This means that if a source Q 
causes a certain intensity at a point P, the identical source placed 
at P would cause this same intensity at the point Q. However, tliis 
must not be interpreted as a reversibility of the light path for dif- 
fraction in the sense that the use, as a source, of a distribution of 
light corresponding to the diffraction pattern of a point source would, 
in turn, have a single point of light as a diffraction pattern, i.e. the 
aperture would act as a lens. The theorem says not hing regarding the 
intensity in the neighbourhood of Q. 

A second very remarkable theorem was discovered by Babinet. 
Imagine a number of apertures of arbitrary shape cut from an opaque 
screen. Let these pieces which were cut out be placed in a large open- 
ing, this system then being considered a second screen. Two such 
screens, for which the openings in one are the same as the opaque 
portions of the other, are said to be complementary. Q3abinet’s theorem 
states that the diffraction patterns produced by complementary 
screens are identical, except for the central spot corresponding to the 
direct light from the source. To demonstrate this we may imagine 
an image of the source donned by a lens of large aperture. In the 
limiting case of full aperture there would be" no light outside the geo- 
metric image. Hence at a point P in the neighbourhood of this image 
the disturbance u would be zero. We now insert one of the screens 
between source and image. Let this cause the disturbance tq at P. 
Let the disturbance produced by the complementary screen alone be 
For no screen at all, the disturbance at P — which is zero — would 


u 


2 - 


be given by the sum of the integrals over both complementary screens, 
Hence by Kirchhoff’s formula, 


0 


2 * 


and 


o o 

= *V> 


(16) 


which proves the theorem. 


II 


THE THEORY OF DIFFRACTION 


381 


XX.] 


Consider now a plane screen containing one aperture (fig. 2). Draw 
a rectangular system of co-ordinates in the plane of the screen and 
place the origin within the aperture. Let the positive 2-axis be normal 
to the screen and directed toward the observer. The co-ordinates of 



the source of light Q are a,, y„, z„ and those of the point of observation 
r are x, y, z . Denote the co-ordinates of an element of area of the 
aperture by £ and 77. The distances r and r Q are then given by 

r 2 =(x- tf+(y- v )* + z \ \ 

V* = (*■ - f ) 2 + (y a - 77)2 + z*}' ‘ • (17) 


We further denote the distance from the source to the origin of co- 
ordinates by R„ that of the observer from the origin by R and the 
direction cosines of the lines QO and OP by a* = - x/R B — « IT? 

and a = x/R, p=y,R respectively. Expansion in po^of /‘and 
17 now gives b 


r + T t— R+ R? +. f K- a)+ 0) + * - + i ? + r l 2 

2 R 2 g “ T* • • • 

= # + #«+«£(£ 77) . 

dfl representation by a power series furnishes us with a natural 



382 


FIELD THEORY OF PHENOMENA 


[Chap. 


classification of diffraction phenomena: If source and observer are 
so far from the screen that we may neglect the terms of <£(f, rj) which 
are of the second degree in -q, we speak of Fraunhofer diffraction 
\ phenomena ; in other cases we refer to Fresnel diffraction 'phenomena. 

If we insert the development (18) in the Kirchhoff integral (15) 
we obtain 

«.,= , ' {cos( ” , ; ) ~r >( ” r,) ^- fl(i,+,t ») fe-^ds. . d9) 

ZaiiK q J 


Fraunhofer Diffraction by a Slit and by One-, Two- and Three- 
dimensional Gratings. 

Following our classification of diffraction phenomena we imagine 
the source as well as the observer to be very far from the diffracting 
aperture, so that the waves which strike this aperture are essentially 
plane. This condition may be realized in practice by placing the source 
of light at the focus of a lens and by studying the diffraction pattern 
in the focal' plane of another lens (see Chap. XXI). 

As a first example, we study diffraction by a slit of width a which 
is of unlimited length in the 77-direction. This means that the quan- 
tities occurring are independent of the y-co-ordinate, so that the 
problem is a two-dimensional one in the z£-plane. We also write i and 9 
for the angles which an incident and a diffracted ray make with the 
normal Oz to the slit; hence a 0 = sint, and a = sin 9. 

The diffraction integral reduces to 

/ a/2 

e -£*(a,-a)£^£ 

-a 2 


P 

•'-a/2 


( 20 ) 


The value of the definite integral may be written down at once, but 
an instructive way to evaluate it is to use the vector diagram, and 
proceed graphically. We divide the slit into small strips of width 
and add the contributions of the separate strips. While f increases 
from —a/2 to a/2 in steps of amount df, the phase difference increases 
in steps of k (sin i - sin 0) df This means that the vectors representing 
two successive strips difier in direction by this amount. Since the 
contributions of all strips of equal width d( are of the same absolute 
magnitude, the series of index points form the vertices of a portioned 
a regular polygon. In the limit, where d| is infinitely small the figure 
biomes a^ circular arc (cf. p. .52). The figure forms a closed loop when 
the phase difference between the first and the last stnp is 2ir. A 
polygon means that the resultant disturbance is zero i.e there is 
complete darkness. In general, there is a minimum whenever 

i(sin» — sind)a = 27rn, (» = 1. 2, 3, . . .) 
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or (sini — sind)a=?iA. ..... (21) 

This is the condition for a minimum for diffraction by a single slit. 
Maxima occur between these minima at points where the resultant in 
the vector diagram is a diameter of the circle, i.e. where 

k(smi — sin0)a = (2 n -f 1)? r, (n = 1, 2, 3, . . . .) 

or where 

(sini — sin0)a = A (22) 


This, then, is the condition for directions of maximum illumination 
in the diffraction pattern of a single slit. 

For a given direction of 
incidence — which we may 
choose, for example, along 
Oz normal to the plane of 
the slit — the amplitude of 
the diffracted light will be a 
function of the difference in 
path for the two edges of the 
slit. The distribution of in- 
tensity may be found from 
(20), or from the vector dia- 
. gram by the method of p. 53. 

The result is shown in fig. 3, where the dotted curve gives the ampli- 
tude of the resultant disturbance, while the intensity is represented 
by the curve in full line. 

If half the slit were covered there would be light in the directions 


A 






Fig. 4 


of the previous minima when n is odd, since the second half of the’ 
mdex circle would now be missing. If we place a large number of such 
haif-shts side by side we obtam a one-dimensional plane grating. Let 
the distance between corresponding points in two neighbouring open- 
ings be called a. The total effect is obtained by addition of the dis- 
turbances from the individual slits. In those directions for whioh 

(sin» — sin 0) a =* mA, (23) 
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the individual disturbances plainly have the same phase, and hence 
there is a maximum of illumination (cf. fig. 4). This is then the con* 
dition for a maximum for the diffraction grating. 

Since, for a given direction of the incident light, the angles giving 
maximum illumination depend upon the wave-length, a grating of 
this kind is eminently suitable for the spectral analysis of light. The 
number m is called the order of the spectrum. In what directions are 
the dark places to be found? From the vector diagram it is seen that 
the first dark spot occurs when the index points of the individual 
rulings (or slits) form a closed polygon. This is the case when the 
phase difference between the first and the last ruling is 2 t t. Thus the 
rays coming from two adjacent lines of the grating must differ in phase 
by (277m + 27t/N) to give the dark spot in the neighbourhood of the 
maximum of order m, where N is the total number of lines in the 
grating. Hence the direction of the minimum adjacent to the mth- 
order maximum is given by 



This is, then, the condition for points of minimum illumination in the 
diffraction pattern of a grating. We have here a means of computing 
the resolving power of a grating, i.e. the smallest relative difference in 
wave-length which can be separated. Following Rayleigh, two wave- 
lengths are considered separate if the maximum of the diffraction 
pattern of one coincides with the first minimum of the other. Let the 
wave-length A -f- dX have a maximum in the direction 6, i.e. 

(sint — sin#) a = m(X -f- dX). 

At the same time, suppose the first minimum of the wave-length A 
occurs at the same point: 

(sini — sin#) a = mX + 

From these two relationships we obtain 

dX _ J_ 

A mN‘ 


The resolving power R is usually taken to be the reciprocal of this 
value, i.e. 

R = A = mN (25) 

dA 

The directions of the maxima and minima obviously do not de- 
pend upon how the resultant of the disturbances from the individual 
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elements of the grating is formed. The only essential thing is that 
we have transparent or reflecting or scattering elements arranged at 
equal distances from one another. It is only the distribution of light 
among the various orders which is affected by the nature of the grating 

elements. 

We pass now to the consideration of what we may call a two- 
dimensional grating. This consists of a large number of apertures 
whose distance apart is uniformly a in the f-direction and b in the 
^-direction. The conditions for maxima are obtained by the same 
construction as above, and are 


(oq— a)a=m 1 A \ 

(ft— ffi 6=sWI 2 A ' 



For a given direction of the incident light, the direction of the maxi- 
mum corresponding to the order numbers and m 2 is uniquely de- 
termined, for the direction cosines a, each determine a cone about 
the corresponding axis. The generators common to these two cones 
give the directions of the maxima corresponding to the orders m 1 
and m 2 . Addition of a third condition — as in the case of the space 
lattice (see below) — would define a third cone about the z-axis. This 
cone, however, would not necessarily contain the common generators 
of the two other cones. 

While the two-dimensional grating has no practical application, 
the three-dimensional grating, or space lattice, is of the greatest impor- 
tance — viz. in the analysis of electromagnetic waves of very short 
wave-length (X-rays). For such short waves it is not possible to pro- 
duce appreciable diffraction angles with ordinary ruled gratings except 
by the use of special artifices. But nature has provided us with other 
gratings whose divisions are very much smaller, viz. crystals. The 
atoms in orystals are arranged at regular intervals, but form a three- 
dimensional lattice. In 1912 M. von Laue suggested the use of crystals 
as diffraction gratings and the experiment was performed successfully 
by Friedrich and Knipping.* 

We limit our considerations to rhombic lattices, in which the lattice 
points lie in three mutually perpendicular directions at distances a, 
b and c. This, of course, includes the simpler gratings in which two or 
all three of the separations might be equal. The atoms act as sources 
of secondary waves, since they are set into vibration by the incoming 
light. Thus the secondary waves in this case have a certain reality, 
as contrasted with the Huygens wavelets in free space. Since the 
atoms are set into vibration by the incident light, they vibrate in 
phase, and the secondary waves which they produce (or scatter) are 


• M. von Laue, Phys. Zeits. 14, p. 421 (1913). W. Friedrioh, P. Knipping and M. 
von Laue, Le Radium, 10, p. 47 (1913). 
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m phase, just as are ordinary Huygens wavelets. Hence the same 
treatment is applicable here. We now have three conditions for the 
directions of maximum illumination: 



or 


( a o ~ a) a = 7^Ai 

(Po~ P)b = »i 2 aL 

(Vo — y)c = m 3 Xf 




(27') 



As we saw from the cone construction for the two-dimensional grating 
(p. 385), the addition of the third condition overdetermines the prob- 
lem. This becomes evident mathematically by noticing that if we 
select a 0 , {3 0 and y 0 arbitrarily, the values of a, /?, y obtained from 
(27") will not, in general, satisfy the condition a 2 + f3 2 -f- y 2 = 1. 
Hence, if we are to have a diffraction maximum at all, there must be 
a relationship between the direction and wave-length of the incident 
wave. If we square equations (27") and add them, we obtain the 
required relationship: 





m 


'‘'l | ”'2Q i ^3 

a a ° + + T V ° 


a 2 


m 


+ -p- + 


m 


c 2 



A maximum is obtained only when this condition is fulfilled. We 
obtain a formula which lends itself more readily to direct interpre- 
tation if we square and add equations (27'). Denoting the angle 
between the directions of incidence and diffraction by 9 and 
remembering that 


cos 9 = aa 0 -F PPq + yy 0 , 
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2(1 — cos 6) 


• U 

2 sin jt = 


-Kj 


Wi 2 m 2 2 

1 7 0~ 


1 | "'25 | 


. (29) 


If the three order integers contain a common factor M , we place it 
outside the radical and have 


2 sin - = A M, 
2 


fJT 2 Jp2. //2 

n I 1 O I” .9. : 


. (30) 


Fig. 5 


where H v H 2 , H 3 are relatively prime. 

Planes may be passed through the crystal in various directions 

such that they contain a large number 
\ of lattice points (cf. fig. 5). A plane of 

\ • • • this kind is known as a reflecting plane. 

" A family of parallel reflecting planes is 

* * characterized by the set of relatively prime 

integers proportional to the reciprocals of 
the intercepts on the axes, each being 
divided by the corresponding grating 
^ X constant. For example, (010) denotes the 

Fi8, 5 set of planes y — 6, y — 26, y — 36, &c. 

These indices, which are the H v H 2> H 3 , 
represent the reciprocals of the actual intercepts of that plane 
which is nearest the origin. This may be seen as follows: If the 
intercepts of one of the planes are Zj, r^, measured in centi- 
metres, then 

77 .77 .u _ ® . A • c 

“1 • 11 2 • 11 3 — r • — • — 9 

tj Wlj 

or after introducing a factor of proportionality t, 

7 ta tb tc 

_ e' _ W’ ^ ~ h: 


7 ta tb 


The usual form of the equation of a plane whose ariftl intercepts are 

5 + — z — x ^i I y H 2 I 
lx 74 ta tb to 

If this is to be a reflecting plane, the equation must be satisfied by 
integral values of the co-ordinates, i.e. by x = pa, y = 56, z = rc. 
Inserting these values in the preceding equation, we have 

pH 1 + q£I 2 +rH fi = t. 
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Since we have the sum of integers on the left, t must also be a whole 
number. Thus the smallest value of £ is 1. According to the results of 
analytic geometry, the distance from the origin of a plane whose axial 
intercepts are a/H v b/H 2 , c/H 3 is 



This is also the distance between two adjacent planes of the set. Thus 
we arrive at an entirely new interpretation of equation (30). There, 
after removing a common factor, the represent the order numbers 
of the direction of diffraction. However, if we interpret the H { as the 
indices of a family of planes, we can express the radicand in terms of 
the distance between the planes and thus obtain 

2d sin l = MX (32) 


This is the condition for a maximum in the diffraction pattern of a 
space lattice, and is known as Bragg’s 
Law. The nature of these planes may 
be seen at once. It is easy to see 
that the plane half-way between the 
incident and diffracted rays (fig. 6) 
belongs to the set characterized by 
H v H 2 , H 3 * Hence, on account of 
the equality of the two angles, we 
may look upon the diffraction process 
as reflection of the rays by the reflecting planes. The difference in 
path of the parts of a wave reflected from the first plane and the 
succeeding plane is given by (see fig. 6) 

AB + BD — AC = — 2 d cot ? cos ? = 2 d sin (33) 

sin - 
*> 





Thus (32) signifies that for reflection from the individual planes the 
angle of incidence tt/2 - 0/2, the distance d between planes and the 
wave-length A must fulfil the condition that the difference in phase 
between the partial rays must be 2Mtt. Angles which satisfy equation 


• The equation of this plane is (s 0 - r)> - (s ~ r )= or 
or the components of (s 0 - s) the values given by equation (27 ) (p. 386) and remo 

he common factors, we obtain 


n 


u 
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(32) are called glancing angles. In order to obtain such a “ reflection 
at a depth ” with monochromatic radiation, it is necessary to rotate 
the crystal in such way that the entire range of angle between the 
reflecting planes and the incident wave normal is covered. This is the 
principle of the so-called Bragg Method.* 

Because the preceding derivation made use of a definite angle of incidence, 
it might be thought necessary to work with a parallel beam. For X-rays, such a 
beam could not be obtained by means of lenses, but solely by using two narrow 
slits. Meanwhile, Bragg and M. de Broglie discovered a focusing condition whose 
fulfilment guarantees that even a diverging bundle of rays from a single slit shall 
give a sharp line for each wave-length. The condition is (fig. 7) that the point of 
observation P and the slit S shall be equidistant from the axis of rotation O, 



Fig. 7 


i.e. the spectrum falls on a circle of radius SO drawn about 0. In the symmetrical 
posrtjon KK of the crystal, a given wave-length reflected at 0 will SZatP 
and for any other position K K\ this same wave-length, and only this one will 

frith l^rill L ^ d ^ W a cir f ° Ie , pa f iDS throu g h S > 0 F, y its intersection 
ffPnf.5 ^ deterrm ? e ^ g° mt of reflection R. It is easy to see that SR and 
P again make an angle of 6/2 with the surface of the crystal: angles SOP and 
SRP are equal, since they cut off the same arc. Also angles SOX' nnri 

rw asS £ ~ 

0/2. alien, also, angle POK is equal to angle PRK' ® 

inteX" 

and so chance local defects in structure bec^lCnleL ^ 

Instead of turning a single crystal so that it assumes', successively, various 

London, Q.' B«uS? Sons (1928).' Z " B “ SW ^ Cry3lal Siruciw *' 5th Edition. 
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positions with respect to the rays, we can assure incidence at all glancing angles 
by using a mass of small, randomly oriented crystals. This is the powder method 
devised by Debye and Scherrer and independently by A. W. Hull.* 

Ex. 99. Give the relative intensities of the diffraction patterns of various orders 
for a grating consisting of alternate transparent and opaque strips of equal width. 

Ex. 100. The Michdson Stellar Interferometer . — Two parallel slits whose dis- 
tance apart is b are cut from a screen. What is the appearance of the diffraction 
pattern if parallel rays are incident upon the screen (a) normally, and (6) at a 
small angle e with the normal ? If both sets of rays are incident at the same time, 
how does the visibility of the pattern depend upon e and b ? 

Ex. 101. A cubical crystal of KC1 bounded by the planes (100), (010) and (001) 
is turned about an axis parallel to the line of intersection of the planes (100) and 
(010). The Aa-radiation of copper (X = 1541 X.U. = 1541 x 10 -11 cm.) is inci- 
dent normal to this axis. Which reflecting planes with indices ^2 cause 
reflection, and what are the corresponding glancing angles 0/2? The constant 
of the cubic lattice is 3T3 A.U. = 3*13 X 10 -8 cm. 



4. Fresnel Diffraction Phenomena at a Slit and at a Circular Aperture. 

Zone Plates. 

Consider first a two-dimensional case. Imagine the source to be 
a line of light of indefinite extent in the direction normal to the plane 
of fig. 8, so that the ?/-co-ordinate does not enter. Further, let the per- 
pendicular from the point of observation to the line source be normal 
to the plane of the screen, and let it pass through one edge of the slit. 
Since sin i = sin 6 = 0, i.e. a 0 = a = 0, in this case, the development of 
</>(£, tj) begins with 

2 rJ 

(cf. p. 381). Replacing k by its value 2 tt/A (equation (4) (p. 377), and 
supposing the slit to extend from £ = 0 to f = a, we obtain for the 
disturbance at P the expression 

_ * v-KnlXtR + Ra) [ a p -i*l*-t*aiR+U R Q)d£. . (34) 

“ A RR a J o 

• See G. L. Clark, Applied X-Rays, 3rd Edition, p. 267 et seq., New York, McGraw- 
Hill (1940). 
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If we evaluate the integral as before, by summing the “ vectors ” of 
individual elements of the slit, we see that in contrast with the case 
treated in § 3 (p. 382), the vectors of the several elements d£ no longer 
make equal apgles with one another; on account of the quadratic 
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term the angle between two consecutive vectors increases as f df, 
i.e. the curvature of the vector curve increases with the length of arc. 
A curve of this nature has a spiral form. In order to obtain an integral 
independent of the special values of A, R , R Q we make the substitution 


8 


and obtain 






■t'V-iR+ilR Q )d£ = 


The curve 





whose parametric equations in rectangular co-ordinates are 


X 


r. 


cos 


r 


is called the Cornu Spiral (fig. 9). The numbers marked on the curve 
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are the values of A. Each number also gives the length of the corre- 
sponding arc of the curve measured from 0; this is seen at once, 
because we have dx/dA = cos ttA 2 /2, and dy Id A = smrr A 2 j2, so 
that (dx/dA) 2 -f- ( dy/dA ) 2 = 1 or ds/dA = 1, where s is the length of 
the arc. 


Once we have drawn this spiral we can employ it to answer a number 
of questions. 

First we notice that the radius vector from 0 to the point A of 
the spiral, whose relation to the slit width a is given by (35), gives 
the disturbance at P as a function of the slit width (with the sign of 
the imaginary part changed). As the figure shows, this radius vector 
passes through infinitely many turning values, whose difference be- 
comes smaller and smaller, the value approaching that of the asymp- 
totic point I when the slit width becomes infinite, i.e. the slit becomes 
a single diffracting edge. If, now, the half of the slit symmetric to the 
above is opened also, an equally great light disturbance must be added. 
For convenience we draw the spiral corresponding to the other half 
so that it is centrally symmetric about 0 with respect to the original 
spiral. It must be remembered, however, that for the second spiral 
the radius vectors are to be drawn from points on the curve to the 
origin. We are now in a position to give the value of the disturbance 
at P when the slit has any position whatsoever on the £-axis — for 
example, when its edges are at $ = a and £ = b. We need only sub- 
tract the vector corresponding to f = b from that of f = a, i.e. draw 
the line connecting the points on the curve which correspond to a 
and b. 

Further, we can calculate with good approximation the distri- 
bution of light on a horizontal line through P parallel to the screen. 
If we limit our inquiry to points not too far from P we may neglect 
the small inclination of the lines QP' and we may expect the same 
disturbance at P' as would exist at P if the screen ended at 
| = -xRJ(R + R q ) instead of at f = 0 (cf. fig. 8). With * = PF 
increasing in negative values we traverse the points of the upper spiral, 
and the distances from the asymptotic point to these points steadily 
decrease. Hence the illumination in the region of shadow declines 
uniformly. Displacing P by a positive x corresponds to withdrawing 
the edge of the screen in the negative direction. In this case the lower 
spiral also contributes radius vectors, giving a series of maxima and 
minima in the region of illumination. The limit for large distance from 
the geometric shadow corresponds to the line joining the two asymp- 
totic points, i.e. double the amplitude (or four times the intensity) at 
the point P. This distribution of light corresponds well with obser- 


Finally, we wish to investigate the distribution of light along 
a normal to the screen when the source of light is a point . 
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If we again have = a = 0, the disturbance at P is given by 

# 

u — e-tKR+Rj f fe-ikw+rtaiii+vitJdgdT). . (38) 

XRR q j j 

For a circular opening (radius a) the integral can be evaluated, and we 
obtain 

I n \ f — • It in i « Ih i \ . _ _ 


U * R 


e ~iHR+R^) { 1 _ e -i*'X.a\llR + i:Rj) } < ( 39 ) 


The intensity is given by the square of the modulus of the complex 
number, and this is given by multiplication by the conjugate complex 
number. Also 


(1 - e~ te ) (1 - e+<*) = 2 (1 - cos x) = 4 sin 2 

2 


The result is 




,V2 




(R + R q ) 2 2A 




(40) 


Thus we see that for increasing size of aperture the illumination varies 
periodically . It must be borne in mind, however, that the formula is 
not to be applied to arbitrarily large apertures, since the inclination 
of the directions of r 0 and r is of importance in such cases. A maxi- 
mum of intensity occurs when 


i+ i=(2 TO +l)4 


(41) 


If we compare this formula with the elementary lens formula, we see 

that the circular aperture corresponds to a convex lens of focal length 

/= a 2 /(2m + 1)A. However, we find no appreciable increase inin- 

tensity at the maxima here; this is first realized by diaphragming 

out those zones whose phase is such as to cause a decrease of the 

resultant amplitude. This corresponds to rendering alternate strips 

of a slit opaque, i.e. to the passage to a line grating. The division into 

zones in the present case must be such that the edges of adjacent 

zones, of which one half is covered, furnish a phase difference of 2 mn 
lnufl we must have 


(Pn+i) : 

A 


(r + w) - (i + n + 


p«+l — 


Pn 2 = 


2mA 

1 + 1 . 

R + R, 


(42) 
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It is seen that this also satisfies the condition that each zone has the 
same area, since this quantity is given by tt { p n+1 2 — p n 2 ). The 
innermost circle, half of whose area is to be covered, has a radius 



With a plate so divided into alternate transparent and opaque rings 
we can actually obtain a considerable intensification of light at points 
which satisfy the equation 

1 1 _ 2mX 

R + pi 2 ' 


For this reason the so-called zone plate is sometimes referred to as a 
zone lens. 



CHAPTER XXI 


The Elements of Geometrical Optics and of 

Interference Optics 


1. The Fundamentals of Geometrical Optics. Laws of Fermat and of 
Mains. 

Geometrical optics has developed as a special field serving the 
needs of practical optics. Its purpose is to compute the details of the 
passage of light through optical instruments and to determine the 
principles of the construction of such instruments. The fundamental 
hypotheses of this branch of optics represent a great simplification of 
the actual facta. They are essentially the following: 

1. Reotilinear propagation of light rays in a homogeneous medium. 

2. Independence of the several rays. 

3. Reversibility of the rays. 

4. The Law of Reflection. 

5. The Law of Refraction (Snell’s Law). 


Assumptions 1, 2 and 3 show that geometrical optics not only 
takes no account of diffraction, but that it does not even contain 
any reference to the wave nature of light. Indeed, in the realm of 
diffraction phenomena the concept of a light ray loses all meaning; 
whde it is possible to assign a ray in the direction of the normal to a 
plane wave incident on an aperture, the normals corresponding to the 
paila of the diffracted wave have very diverse directions. In the cal- 
culation of optical instruments with the aid of geometrical optics the 
phenomena due to diffraction at the various apertures or diaphragms 
are not taken into consideration. When needed-e.g. for the deter- 
mination of resolving power-these diffraction effects must be treated 
separately by the methods of the Theory of Diffraction. 

fi nX c mP . 0rt r al ^ general tileorenM follow from Hypotheses 1, 4 and 

ti f. i ? e 1 T a 8 discovered by Fermat and is stated thus: 

Ht S WayS tak f P laoe “ ^oh way that the actual 
optical path (length of geometric path multiplied by index of refraction 

w*lT l? e X m) “u an ^emum (i-e- has a stationary value) compared 
with all other paths which do not follow the laws of optics, i.e. 


= Extremum. 

895 
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For media in which the index varies continuously, this becomes 

J F ft ds = Extremum (!') 


To prove this we shall show that the first variation of the optical path 
vanishes. For propagation in a homogeneous medium the law is self- 
evident, since the straight line is the shortest distance between two 

points. Again, for the case of re- 
flection, the proof may be reduced 
to this first case, since if the laws 
of geometrical optics are valid, the 
light reaching P 2 may be considered 
to come from the image P/; the 
path Pj'P 2 equal to the actual 
path and, being a straight line, is 
again an extremum. We give a 
proof also for the case of refraction. 
Let the refracting surface be the 
try-plane of a co-ordinate system and 
let the plane of incidence be the 
yz-plane. Besides the actual path 
we consider a varied path (fig. 1) 
whose intersection with the try-plane has the co-ordinates Sr and 8y, 
while the “optical” ray passes through 0. Then, if L is the optical 
path, we have 

8P = n^i * + Srj) + n 2 (r 2 -|- Sr 2 ) — (n^ + n 2 r 2 ) 

= ^ 1 V*! 2 4* (yi — Sy) 2 + Sr 2 4- r\ 2 Vz 2 2 4- (y 2 — Sy) 2 4" Sr 2 
— r ^ Vh 2 4- !/i 2 — n 2 V z 2 2 + V 2 ( 2 ) 



Neglecting terms of second order, this becomes simply 


8 L 


_ _( 

VvV - 


* 2^2 


+ y 2 V z 2 2 + yi 


^Sy. 


But 


*i.Vi 


V h 2 + y 2 


n 2 ^ 2 ^ sincq 4" n 2 sinag. 


Vz 2 2 4- yi 


However, by Snell’s Law, the right member is zero, which proves the 
theorem for refraction. Since any optical path in geometrical optics 
may be considered to be a combination of reflected and refracted rays, 
the theorem is thus demonstrated in the general case. 

If an image of a point P is formed at another point P many rays 
unite at the image, all of them having travelled according to the laws 
of geometrical optics. Each ray must represent a stationary optical 
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path, as compared with a neighbouring path which does not obey 
these laws. This is only possible if the optical path is constant for all 
optical rays. From the standpoint of wave optics, the constancy of 
optical path is a basic necessity, since the waves which converge in 
the image must all arrive in the same phase. 

In order to understand the second important general theorem, 
the Law of Malus, it must be recalled that we distinguish between two 
kinds of ray systems in geometry — those which represent the normals 
to a surface (orthotomic systems of rays) and those having no ortho- 
gonal surfaces (non-orthotomic systems). The rays emanating from a 
point are certainly orthotomic, for the spheres with centre at this 
point are the required orthogonal surfaces. The Law of Malus states 
that the property of orthotomy is not destroyed by reflection or re- 
fraction, although the originally spherical orthogonal surfaces may be 
altered in shape. This law is self-evident from the standpoint of wave 
optics, for there must always be wave fronts corresponding to the rays. 
We therefore refrain from giving a proof from the point of view of 
geometrical optics. One further consequence of the Law of Malus is 
important. If we consider the parts of two rays included between two 
surfaces, their optical paths must be equal in length. This must also 
be true for rays that intersect later on, in which case the second wave 

surface passes through the point of intersection, where it has a singular 
point. 

2. The Properties of Collinear Projection. 

The ideal case of optical image formation is that in which all rays 
coming from a point unite again at another point, while the images 
of straight lines are straight lines, and the images of planes are again 
planes. A priori y one cannot say whether or not this type of image 
formation is possible with optical means. We shall assume that it is 
possible, and without going deeper into the question we investigate 
the properties of this type of correspondence between the points of 
the object space and those of the image space. Such a correspondence 
between the co-ordinates £, v> £, of P and f , rf, (' of P' is represented 
mathematically by linear fractional transformations 


£' = + Cj g -f d, 

a oi + b 0 7) -f- c 0 £ -f d 0 

«/ - g g£ + b 2V + c 2 g + d 9 
a ot + M -f C 0 £ + d 0 

£' _ g 3 ^ 4- b S 7) -f c 3 g -f 

a o£ + TQ + C 0 £ + d Q . 


. (3) 
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As may be verified immediately, the equation of a plane 

+ mr)' + n£' — p = 0 


in the image space corresponds to a plane in the object space also. 
The intersection of two planes — i.e. a line — in the object space corre- 
sponds to a line in the image space, and the intersection of three planes 
— i.e. a point — again corresponds to a point. We shall restrict our dis- 
cussion to axially symmetric (centred) systems. 

If the f-axis is the axis of symmetry, we need investigate the image 
formation only in a meridional section, using the co-ordinates ( and 
77. The symmetry of the image formation requires only that the 
co-ordinate be unchanged when 77 is replaced by — 77 but that the 
sign of 77' change when that of 77 is reversed. Hence for a centred 
system b 0> b v a 2 and d 2 vanish, so that we now have the simpler 
expressions 

t> __ / = b 2 V _ 

a o€ 4- <V a o£ + ^0 


Solved for f and 77, these equations are 

t d o? ~ d ' = Mo — Vib' 

— a l}€' + a l ^ ^2 ( a o£ a i) 



The infinitely distant plane of the image space corresponds to the 
plane a 0 { + d 0 = 0; that of the object space corresponds to the plane 
a 0 £' _ = 0. These are called the focal planes of the respective 

regions. Their two points of intersection with the axis are called the 
principal foci F and F' of the system. The transformation formulae 
become still simpler if we move the zero points of the f- and f-axes 
to the respective foci. We shall also take the positive direction of the 
axes in object and image space to be that in which the light is, propa- 
gated. Denote the co-ordinates of this system by x and y and x' and y 
respectively. We then have the transformation 


*= £ + y = v \ 

a o [ 



Introducing the notation 



aA 


a t 



9 
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we then obtain 


** =//' 




The lengths /and/' introduced here are called the focal lengths of object 
space and image space respectively. We shall see at once that they 
are identical with the focal lengths introduced in the elementary lens 
theory. 

The ratio y' jy is termed the transverse magnification. For the points 
of the plane x = — f t which corresponds to the plane x' = — this ratio 
is unity. These two planes are called the 'principal planes or unit planes. 
Their points of intersection with the axis are the principal points 
H and E'. Any optical system may be schematically represented by 



an axis, the foci and the principal points. The distance e from a point 
of the object to the principal plane in the object space is called the 
object distance; similarly, e\ the image distance , is that of the corre- 
sponding image point from its principal plane. Then (see fig. 2), 

—e=f—x. e' =f -f- x' 
and the first equation of (6) gives 

(0 -f) V -/')=#' or {+■£ = 1. ... (7) 


formula 3 *™ 161110 f = and We obtain tbe elem ™tary lens 


1 + 1 = 1 
« /• 



. ^>.“7 opt ! cs we measule the image and object distances 
for a thm lens from the middle of the lens. For a more complicated 

optical system, e.g. a photographio objective, this is no longer allow- 


UBUaU y written for the absolute values, ie 
mgn attached to 1/e. The same is true for equation (8), following. 


without the minu 






400 


FIELD THEORY OF PHENOMENA [Chap. 

able. In this instance the unit planes (from which these distances are 
measured) may be far apart and even outside the lenses themselves. 

For the construction of the image we make use of a ray parallel 
to the axis and another passing through the focus (fig. 2). The former 
meets the unit plane of the object space at A ; this point corresponds 
to the point A' at the same height on the unit plane of the image 
space. The ray from P through A and A', parallel to the axis, must 
then pass through the principal focus F' of the image space. The ray 



from P through F , on the other hand, meets the first unit plane at B. 
This ray must then cut the other unit plane at the point B\ which is the 
same distance from the axis, and must then travel parallel to the 
axis. The intersection of the two rays in the image space locates P', 
the image of P. A case of particular importance is that where the 
object point is at infinity and in a direction which makes an angle u 
with the axis. This point corresponds to a point in the focal plane of 
the image space; its co-ordinate is found as follows: the ray in the 
direction u which passes through F (fig. 3) meets the first unit plane 
at A. This corresponds to the point A' at the same distance from the 
axis on the other unit plane. The ray then runs parallel to the axis, 
cutting the focal plane at P'. Then we have the relationship 

y' = — / tan u (8) 

This equation makes it possible to convert angular distances in (e.g. 
astronomical measurements) into linear distances in the focal plane of 
the observing instrument. 

In analogy with our definition of transverse magnification (p. 399) 
we may define the magnification in depth as the ratio of the difference 
of the abscissae of the two image points to the difference of the 
abscissae of the object points. This ratio, however, depends upon 
the abscissa itself, and thus is not definite except for infinitesimal 
differences. It is therefore preferable to define the depth magnification 
as equal to dx'/dx. Then, by (6) (p. 399) 

dx x 2 x 


( 9 ) 
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Since this quantity depends upon x , we see that collinear projection 
does not preserve similarity. 

If / and f' have different signs, as assumed, then a movement of the object 
will cause the image to move in the same direction. This is the case for reflection. 

The angular magnification or convergence ratio is defined as the 
ratio of the tangents of the angles between the axis and a ray in image 
space and in object space, the ray being such as to cut the axis in each 



region. This ratio is constant for a given point on the axis, for it 
follows from fig. 4 that 


tan u' __ f+x __ f+x (f -\- x)x x 

tanw /' + *' f( x +/)“/'* 



The product of the transverse magnification and angular magnification 
for a given plane is constant for the entire system, for we have from 
equation (10) and equation (6) (p. 399): 


yfi tarn*' _ / 
y tan u ~ /'* 



Combination of two imaging processes 

The result of two successive collinear transformations is again 
collinear, and so there must be a single projection equivalent to the 
two We set ourselves the task of determining the four cardinal points 
for the result, i.e. the focal points and the principal points or their 
corresponding planes. Fig. 5 shows these planes for each of the two 
processes, denoted respectively by subscripts 1 and 2. The same letters 

AravrZlI f°* "^T* 6 . ° f P^ne with the axis. 

A ray parallel to the axis and distant y from it is drawn in the obieot 

fueling • Aft f ,T etmg Hl H i Jt S oes through the focus F/. Its 

inclination to the axis in the first image space is 



(12) 
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The final focal point will be the image of the object as formed 
by the second system. Denoting the separation F^F 2 by D , which is 
reckoned positive when F 2 is to the right of FJ . we have by equation 
(6), p. 399: 

x » (*.') = —D, x\ = FjF' = — f-jf. . . (13) 


The principal plane H' may be located graphically as follows: 
If H ' is the principal plane of the combined system, a horizontal inci- 
dent ray distant y from the axis will finally pass through F ’ . Then H' 
must contain the intersection of the horizontal line and the ray pass- 



ing through F' that corresponds to the ray F^H^ . Algebraically, the 
focal length /' — the distance between F' and H ' — is obtained by ap- 
plying equation (10) to the angular magnification produced in the 
second system, remembering that u 2 = w/. Then 


tanuo' = — tanu 


D _ y D 
' K ~ fi ’ ft 


Also, according to fig. 5, since F' is to the left of IF, 


tan u' = tan u 2 = 



whence 



Similarly, one finds 




(14) 

(IT) 


If, as is usual, the same medium (air) is present on both sides of 
the lenses, then the two focal lengths will be equal m magnitude and 
opposite in sign. This will be shown in the next section. If the system 
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consists of two thin lenses, for each of which the principal planes may 
be assumed approximately coincident, the distance d between lenses 
may be written d = D +/ x ' — fz> or d = D -f ~ff -\-f 2 ' (see figure). 
Putting this value of D in (14) and in (14') we get, with the lenses in 
air, 


1 1,1 d j 1 1 , 1 , d 

S' SifS fifS J ~fi + / 2 + SJS 



If, in addition, the lenses are close together, the result is simply 


/ /i / a 


(16) 


Hence for thin lenses in contact, the reciprocals of the separate focal 
lengths simply add up to give the reciprocal of the combined focal 
length. For this reason the reciprocal of the focal length of a lens is 
called the 'power <f> and is measured in diopters , one diopter being the 
power of a lens whose focal length is one meter. It must be remembered 
that the additivity of the powers — which holds, incidentally, for any 
number of lenses — is valid only when the same medium is present on 
both sides of the system. 

Ex. 102. The solar disc has an angular diameter, as seen from the earth, of 
1/2°. How large is the image formed by a lens of focal length 2 m.? 

Ex. 103. The plane of an object is inclined to the axis of a lens system. Locate 
the image plane. 


3. The Practical Problem of Image Formation. Abbe’s Sine Law. 

General Path of an Elementary Pencil from a Point Source. 

(a) Imaging of points near the axis by means of narrow pencils 

Suppose that the angle between the rays diverging from a point 
on the axis is so small that the sine and tangent may be taken equal 
to the angle itself. Assume further that only points near the axis are 
considered, so that their distances from the axis may be taken equal 
to the lengths of circular arcs whose centres are on the axis. Under 
these conditions, calculation of the refraction at a centred spherical 
surface leads to the collinear formulae. Since the net effect of suc- 
cessive collinear transformations is again collinear and since, to the 
above approximation, any surface of rotation may be replaced by its 
sphere of curvature, it may be said in general that within the limits 
of accuracy described, any system consisting of coaxial refracting sur- 
faces of rotation is collinear. This statement is true even if some of the 
surfaces are reflectors, since this case is merely one where n = -1. 

ihe truth of the above statement will be shown for a refracting 
spherical surface separating a medium of index n, on the left, from one 
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of index 7i on the right. The convention regarding algebraic signs will 
be that all axial segments to the left are negative, those to the right 
are positive; the curvature of a surface is positive in the convex side 
is facing the light. In fig. 6, let P be an axial point whose distance 
from the vertex of surface R is — s; and let a ray leaving P at an 
angle e with the axis arrive, after refraction, at another axial point P' 
whose distance from R is s'. We must show that s' is independent of 
the angle e, i.e. that all rays coming from P again unite at P'. Draw 



the radius to the point K and denote the angle RMK by y. Then, 
using the sine law of trigonometry and replacing the sines by the 
angles themselves, we have (see figure) 


r 


s 




and 




Dividing one equation by the other and using the law of refraction in 
the form na = n'fi, we have 


(s — r)s' _ n' 
(s' — r)s n * 




The angles a and are not present in the result, 
expression 


The value of the 



remains constant, and it is said to be a surface invariant. Equation (19) 
may be written in the form 


— n n n — n 

s' 


s 


r 


. . ( 20 ) 
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Comparing with equation (7), p. 399, and putting e — s and e' = s' 
yields * 

/•= — — r = (9i > 

J n' — ri * n' — n # * ’ * ' * 

and hence = — — . _ . ^9.9^ 

/ n W 

If (22) is substituted into (11) and if u is substituted for tan u, the 
result is the important Helmholtz-Lagrange equation 

nyu = n'y'u' (23) 

Thus the expression n-y^i is also an invariant. Equation (23) repre- 
sents the relation between angles and lateral distances in the object 
and image spaces even when there are many refracting surfaces, 
whereby it should be noticed that a large image corresponds to a 
narrow pencil and vice versa. 

With the help of (22) we can now determine the combined effect of 
two successive imaging processes in the general case where the indices 
in object and image spaces are not the same. Combining equation (14) 
p. 402, with d = D -\- ft — f 2 and neglecting d , we have 

— — 1 A 1 . n 2 

f fiSt SxR ~ ft + ^ 77 " ■ • (24) 

or, introducing the power <£, 

n 2 <f>' = n 2 ft + n.' ft ( 95 ) 

Thus for a lens of index having a medium of index n 0 to the left and 
one of mdex n 2 to the right, the last equation becomes 

n 2 <f> = ft -f- n 2 ft, 

surfaces*' ^ ***’ res P eotivel y- the P°wers of the front and rear 

*° no !T ** ba3 bee “ sb own that refraction by a spherical surface 
can be described by a relation of the form of equation ( 7 ) by identi- 

fyhig nr/(n - n) with/ and n’r/(n’ - n) with f, and by measuring 
Sex ofth 3 em St T C ^ fr0m ^ to the at thf 

vertex ot the sphere. In this case the focal planes are really distant f 

SS'iSrf’SS* *r, “■ "r ■ *■* r.p“ X KSf 

2S&; irsTLfe jz\th ‘ “ <*» »<> 

(3 = x +./and f ~s a" “ eas fi ured ftom the focal planes 

J f), then the first collinear equation of ( 6 ) 

* The factors 1 /» and \(n' imply only a difference of scale in the two spaces. 
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results. It remains to show that the second one is also satisfied. In 
order to find the object-image relation for a point Q near the axis 
(fig- ?)> imagine the line PP' together with the other rays to be rotated 
through a small angle about the centre M of the sphere. Then P goes 
to Q and P' goes to Q' . The arcs PQ and P'Q' may, to the approxi- 



mation adopted, take the place of the axial distances y and \f. Then, 
from the figure and from equation (19), 




s' n 
s n' 



If x and x ' are measured from the focal planes, so that we take 
— s = —x —f and s' = x' - 1- /', then the use of xx' = ff gives us 


y- x’f + ff x’(f+x) _ x' _ f 

y~ xf'+ff /'(/+*) S' *’ ’ 


so that also the second collinear equation is fulfilled. 


( b ) Imaging of ■points near the adds by means of wide pencils. 

As soon as wider pencils of rays are used, the sine and the angle 
may no longer be taken equal and it is found that the image distance s 
depends on the angle €. Rays emanating from P no longer re-unite 
in axial point P' but enclose a curve (or, in space, a surface of rotation) 
called a caustic. It is possible, however, to find mathematically & 
refracting surface such that all rays coming from P will converge again 
at P' . To obtain the analytical equation it is only necessary to apply 
the condition of constancy of optical path. For the case of reflection 
this surface is obviously an ellipsoid of rotation having the givep 
points as foci, and it passes over into a paraboloid of rotation as P is 

moved to infinity. . . . , <> 

The imaging of an axis point is of little practical importance, 
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even the neighbouring parts of an object lying in a plane normal to 
the axis are not sharply focused. If the problem is to form an image 
not only of an axial point but of an entire element of area lying normal 
to the axis — something that can be done by using several refracting 
surfaces— then a condition first discovered by Abbe must be satisfied. 
This condition will now be deduced, using Fermat’s principle (p. 395). 
Let the optical system, say a thick lens, be represented by the curved 
lines in fig. 8. The point P has its image at P ' ; and P lt which is in a 
plane through P normal to the axis, has its image at P/, which lies 
in a plane through P' and normal to the axis. Rays drawn through P 
and Pi parallel to the axis cross at F' } while parallel rays drawn through 
this pair of points at an angle u to the axis will meet at some point F x \ 
It will not be concluded that F’ and F x lie in a plane normal to the 
axis; for this amounts to assuming that an infinitely distant plane 
is imaged, point for point, in a focal plane. This is more than we wish 



to prove, since our deduction is limited to an element of area. Let the 
parallels to the axis enter the optical system at A and A 1 and leave 
it at A' and A /. Similarly, let the oblique parallels enter at S and S 1 
and leave at S' and S x . According to the Fermat principle, the follow- 
ing relations will hold for the optical paths, which are denoted by 
parentheses : J 

(PAF'P') = (PSFiP') and (P^F'P^) = (P^/Pj'). (a) 


The paths (P^A^F') and ( PAA'F ') are equal, since F' is the inter- 
section of two rays that are normal to an orthogonal surface (cf. p. 402). 
Further, (F'P X ) and ( F'P ') are equal as far as small quantities of 

second order, because of the smallness of (P'P/). Neglecting these 
term, we have & 

(PAF'P’) = (P 1 A 1 F'P 1 ’) (6) 


Combining with (a), (PSP 1 'P') = (c) 

Drop a perpendicular from P on to PjS, and call the foot of it N. 
■men, because both rays are normal to the same orthogonal surface, 


(PSF,') = (d) 
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Subtraction of (d) from (c) yields 

(*i ' P ') = -(NP 1 ) + (F 1 'P 1 ') (e) 

Now erect a perpendicular P’N to the line FfP'. As far as second 
order quantities, 

(F 1 'P 1 ')-(F 1 P') = (N'P 1 '), ....(/) 

that is, {NP 1 )=N'P 1 ') ( g ) 

If the index of refraction is n in the object space and n' in the image 
space (the two indices would be different, for instance, for an immer- 
sion microscope objective), then 

(NPi) = ny sinu, ( N'P x ') = n' y' sin u ' ; . . (A) 

whence, according to ( g ), 

ny sinu = n' y' sinu' (28) 

In place of the Helmholtz-Lagrange equation we have the relation (28), 
while according to (11) and (22) collinear projection amounts to the 
same relation with the tangents in place of the sines of the angles. 
This shows that collinear projection of the entire space, using arbi- 
trarily wide pencils of rays, is fundamentally impossible. Similarly, 
we could proceed to show, as in the deduction of the sine law, that if 
the sine condition holds for a given pair of axial points it is impossible 
for it to be valid for any other pair. The special case of plane reflec- 
tion is an exception. Here the whole of space may be imaged collinearly 
using wide pencils, for since n = n , y = y and u — — u\ the sine and 
tangent conditions are compatible in this instance. 

(c) Imaging of points distant from the axis by means of narrow pencils 

Even an elementary pencil from a point off the axis will fail to focus 
at a point after refraction at a spherical surface. Moreover, it is readily 
seen that rays in the meridian plane — the plane containing the axis 
and the point — have a different point of intersection from those in 
the plane normal to this one. Nevertheless, according to the theorem 
of Malus, every elementary pencil has an orthogonal surface. 

Imagine the pencil so limited that the orthogonal surface in the 
image space is boimded by two pairs of neighbouring lines of curva- 
ture (fig. 9). As we know, neighbouring normals to the surface inter- 
sect along the lines of curvature. The normals at A and B intersect 
at E, those at C and D have their point of intersection at F . The 
straight line EF is normal to the principal ray drawn through the 
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centre of the element of surface. In the same way, the second pair of 
lines of curvature furnishes a line of intersection GH which is in a 
direction at right angles to EF. The distances AE and AG represent 
principal radii of curvature of the surface element. The rays reunite 
at a single point only when these two radii are equal, otherwise there 
are two focal lines whose distance apart is called the astigmatic dif- 



ference A. An astigmatism of this kind occurs when the principal rays 
meet a spherical refracting surface obliquely. 

Ex. 104. Show that the laws of reflection and refraction hold also for the angles 
which the incident and reflected (or refracted) rays make with any plane contain- 
ing the normal to the surface of separation. 

Ex. 105. In a prism spectrograph, why are the lines curved ? 

Ex. 106. Compute the focal lengths and locate the principal planes of a thick 
lens of index n. 

Ex. 107. Find the form taken by the sine law for an infinitely distant axial 
point object. (Introduce the height h at which a ray coming from an axial point 
cuts the first surface of the system.) 

4. The Resolving Power of Optical Systems. 

Even if the science of geometrical optics would enable us to con- 
struct an objective which, by compensation of the various imper- 
fections of single-point image formation, brought all rays from a point 
source (e.g. a distant star) to a point focus, observation would still 
show the image to be a small disc surrounded by a system of rings. 
This pattern is due to diffraction by the edges of the lenses and dia- 
phragms. Although the point of observation is in the finite region — 
in our example the focal plane of the telescope objective — the case is 
one of Fraunhofer diffraction, since diffraction at the aperture gives 
rays in various directions; this is equivalent to infinitely distant point 
sources of various intensities. The evaluation of the diffraction integral 
may be performed for a circular opening with the aid of Bessel func- 
tions. Qualitatively, the intensity variation is not different from that 

ound for a slit. However, for normal incidence, the first minimum 
is in a direction 6 given by 


r sin0 = 0-61 A, 


(29) 
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where r is the radius of the opening. A second point can be separated 
from the first if the angular distance between the two is at least sc 
great that the maximum for the second point falls upon the first mini - 
mum in the image of the other. Thus, for a circular aperture, 



0-61A 

r 



The limit of resolution ” thus defined cannot be influenced by in- 
creasing the magnification; its value, however, is inversely propor- 
tional to r, and this is the reason for the large dimensions of tele- 
scope objectives. 

Ex. 108. Calculate the resolving power of a prism in terms of its dimensions 
and the dispersive power dnfd\ of the substance of which it is made. 


5. The Fundamentals of Interference Optics. Interference Fringes. 

In the construction of a certain class of optical instruments, the 
so-called “ interference ” instruments, the wave nature of fight 
is taken into consideration in so far as several coherent pencils 
are combined, this being done in such way as to allow for their dif- 



ference in phase. The diffraction phenomena occurring at the 
boundaries are not of essential importance in this type of device. 
If a plane wave be reflected from a wedge of small angle (fig. 10)), the 
superposition of the waves reflected from the front and back sides of 
the wedge will yield an increased or decreased resulting disturbance, 
depending upon the phase difference of the emerging waves. But the 
phase difference depends only upon the optical path between the two 
faces of the wedge; thus there will be places where the disturbance is 
intensified and others where it is weakened. If the incident beam is 
exactly parallel we shall not be able to see these places at all. As a 
matter of fact, it is physically impossible to produce such a beam, 
since it would require a source consisting of a single point of fight 
(whose surface brightness would therefore have to be infinite) placed 
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at the focus of a lens. In reality a source of light always has a finite 
extension, and since every point of the focal plane corresponds to a 
given parallel bundle of rays, the resultant beam always has a certain 
divergence. We shall assume, however, that this divergence is so 
small and that the thickness of the wedge is also so small that the path 
difference at a given point on the wedge is the same for all rays. The 
rays incident at this point then give a resulting illumination whose 
intensity depends upon whether the path difference at the front sur- 
face is an integral number of whole wave-lengths or half wave-lengths. 
In the former case, the resulting illumination is intense; in the latter 
case, weak. The eye or a photographic objective focussed on the 
upper surface of the wedge will observe a system of light and dark 
bands, or “ interference fringes These are the “ curves of equal 



tkckness ” of the wedge. As the angle of the wedge is increased, or as 
its thickness is made greater, even the rays at a given point have 
difierent path differences and the visibility of the fringes decreases 
considerably even for monochromatic light. 

On the other hand, if we allow a divergent beam to fall upon a 
plane parallel plate, the difference in inclination gives rise to a dif- 
lerence in path of the rays reflected from the two surfaces. If the index 
of refraction of the plate or film is the same as that of the surrounding 
medium, as in the case of the Fabry-PSrot interferometer, the path 
difference for an angle of incidence a is (cf. fig. 11) 

AC + CD — AB = — 2 d tana sina = 2 d cosa. (31) 

In those directions for which 2d cos a = mA there is light, while for 

?! 10nfl ky 2 d cosa = (2m -f- l)A/2 there is darkness. All rays 
making an angle a with the normal to the surface yield a circle in the 
local plane of an objective focussed for infinity, on account of the com- 
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plete symmetry about the normal, so that the fringe system consists 
of concentric circles. The rings become narrower if the thickness of 
the plate is increased, but — unlike the previous case — their visibility 
is limited only to the degree that the light source is not strictly mono- 
chromatic. If we assume that a spectral line consists of, say, two 
strictly monochromatic components, there will be a condition where 
the interference fringes of one component fall between those of the 
second, making the resultant pattern indistinct. In reality, no 
spectral line is strictly monochromatic; the fact that a radiating 
atom sends forth a succession of finite wave trains means that the 
spectral line actually consists of a continuous distribution having a 
sharp maximum. This may be seen by considering the finite wave 
train analysed into an equivalent sum of Fourier components. The 
result is that the interference effects due to the different wave-length 
regions overlap when the path difference is large. 

If we have but two interfering rays, the vector diagram indicates 
a decline of intensity according to the formula 

/ = cos 2 | (32) 

where <f> is the phase difference. If we allow a larger number of rays 
to interfere we obtain a considerably steeper decline of intensity, 
and hence greater resolving power for wave-lengths which are close 
together (cf. p. 52). This condition may be realized by lightly 
silvering the two surfaces of a plane parallel plate, so that these 
surfaces still transmit part of the light falling on them. Light 
transmitted by the system consists of a large number of partial rays 
(in reality infinitely many, with decreasing intensity) which have 
experienced various numbers of reflections. This arrangement is used 
as a spectroscope of very high resolving power, and is the familiar 
interferometer of Fabry and Perot. If we had a finite number of partial 
beams of equal intensity we could apply the results obtained for the 
grating by means of the vector diagram. We found the resolving power 
of a grating to be (p. 384) 

R = Nm (33) 

The difference here is that the number of rays of a given intensity 
is small, while the order— i.e. the path difference for neighbouring 
rays — is large in the present case. If the thickness of the plate is 
025 cm. and we have green light, A = 5 X 10 -5 cm. incident normal y, 
m = 2d/X = 10 4 , while m is usually less than 5 for the grating. The 
fact that we really have infinitely many rays whose intensity de- 
creases geometrically modifies the considerations somewhat, inere 
is no longer a sidewise minimum, and we must impose a criterion 
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which determines the distance of a second max imum which just may 
be perceived separately. In practice, an interferometer of this type 
is characterized by the equivalent number of rays of equal intensity 
—a number around 20. Thus in our case this implies a resolving power 
of about 2 x 10 5 . To obtain an idea of what this means, let us con- 
sider the two D-lines of sodium, A = 5890, 5896. The fractional wave- 
length difference is about 10~ 3 , so that the above interferometer would 
be capable of resolving two lines whose separation is about one two- 
hundredth that of the sodium lines. 

One disadvantage of this type of interference spectroscope, how- 
ever, is its small region of dispersion, which makes it necessary to 
use this instrument by crossing it with another type of dispersive 
instrument, e.g. a grating. If the incident light embraces a consider- 
able wave-length region, the resulting interference pattern will be 
hopelessly complicated. The “ region of dispersion ” AA is that dif- 
ference in wave-length for which the w-order interference pattern 
of one wave-length coincides with the (m + 1) order pattern of the 
other. Then 

m(A + AA) = (m + 1)A 





Thus the region of dispersion in the example above is one-tenth the 
separation of the sodium lines. 


Ex. 10b. Calculate the difference in path of rays reflected from the two sur- 
faces of a plane parallel plate of glass if the rays are parallel and enter the plate 
at an angle of incidence a. The outside medium is air. V 




PART TV 


The Theory of Electricity 
II. The Atomistic Nature of Electrical Phenomena 

In the preceding part of this work we considered a theory whioh found its 
complete formulation, for stationary bodies, in the Maxwell Field Equations. 
This theory made use of a number of experimentally determinable material 
constants — the conductivity a, the dielectric) constant K and the magnetic per- 
meability p, as well as the index of refraction n which is derivable from these 
quantities. But, as mentioned in several instances, these quantities are not in- 
variable material constants, but depend upon temperature, pressure, and — for 
rapidly varying fields — upon the frequency. By its very nature, the field theory 
is powerless to give any information concerning this variability of a , K and p; 
rather, we must try to construct a picture, based upon suitable assumptions as 
to the structure of matter and of electrical oharge, which furnishes the correct 
functional dependence of these quantities upon the several variables. Thus in 
contrast with the field theory, which uses only macroscopio quantities we must 
employ concepts of a microscopic nature which, in part, are not capable of direct 

verification and whose validity can be tested only by examining the conclusions 
obtained from their use. 




CHAPTER XXII 


Electrolytic Conduction 


I. The Fundamental Phenomena of Electrolytic Conduction and their 
Interpretation. 


We begin with the consideration of a phenomenon the experimental 
laws of which are not explicable by the Maxwell equations, but have 
been known for a long time. These empirical relationships led, at an 
early period, to hypotheses concerning the nature of electricity. The 
phenomenon referred to is the conduction of electricity in solutions 
of acids, bases, or salts — the so-called electrolytes. 

Electrolytic conduction follows Ohm's Law at all field strengths from 
the weakest to the strongest, i.e. at constant temperature the conductivity 
is independent of the field strength. Only recently have small departures 
from Ohm’s Law been found for the highest fields. This type of con- 
duct 1011 is accompanied by a transport of matter. The constituents 
of the substance in solution are deposited at the electrodes in accor- 
dance with the familiar Faraday Law, which states that the electro- 
lytic liberation of one chemical equivalent * of a substance always requires 
the same quantity of electricity, viz. : 


F = 2-893 x 10 14 e.s.u. gm. sec./mol = 9649 x 10 4 amp. sec./kg. mol. 

A chemical equivalent is understood to mean that mass of an element 
Tf °' ele “ (° hemi fl radical) which could displace one gramme 

a /2 V63 6 f°v a ‘JT hydr ° gen -/- e ' 107-9 «“■ of univalent. silver, 

S2, & c f blVa C ° PPer ’ (1/3)(I38-9 > ° f trivalent Iantha- 

If we look upon the atomic nature of matter as a proven fact 
as we now have every reason to do. the Faraday Law makes tt 
atomic character of electricity seem extremely probable One equi- 
valent of silver contains L = 6-02 x 10“ silver atoms. The transfer 
f tbs number of atoms is associated with the transfer of a quantitv 

givenby 7 ’ S<> ^ *** eaoh atom conveys a charge e 


F _ 9650 x 10 4 

L 6-02 x 10 26 ~ 1-60 X 10-19 coulombs = 4-80 X 10' 10 e.s.u 


Since we always find this value in every experiment, it is natural to 

I4 ’ lD th ° M ' K ' S ' s y 6tem th0 kilogramme mol must be used. 

(B 711) 
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assume that this quantity is not only the average value— which is all 
we are justified m concluding thus far— but that every univalent atom 
actually carries a charge e, every bivalent atom a charge F/{L/2) = 2e, 
&c. The charge e may be called the elementary unit of electrical 
charge— the “ atom ” of electricity. For reasons which we shall dis- 
cuss later, it is most commonly called the charge upon the electron. 
The existence of this element of charge has been demonstrated 
beyond question by a large number of experiments. On account 
of their behaviour in electrolytes, we call atoms or atom groups 
which carry one or more elementary charges ions (Greek iov, going). 
What is the origin of the charged particles? We now know from 
X-ray crystallography that a simple crystal of a salt, e.g. an NaCl 
crystal, is built up not of neutral atoms, but of positive Na + ions 
and negative Cl~ ions. When dissolved in water the lattice struc- 
ture is destroyed, but it is very probable that the ions retain their 
charges. It is, of course, possible that a certain part unite to form 
neutral NaCl molecules. 

In the case of electrical discharge in gases, the carriers of the cur- 
rent are formed by the field itself, as we shall see later. This cannot 
be true in the present case, since we find the same value of the con- 
ductivity for the weakest fields as we do for very strong ones. 

If, in one cubic centimetre of an electrolyte, we have N+ positive 
ions of valence z+, and N_ negative ions of a substance of valence z_, 
and if the speeds of the two kinds of ions are v+ and respectively, 
then the charge flowing through unit cross-section per second is 

i = (z+N+v+ -f- z_N_v_)e, (1) 

for as many positive ions pass through the unit cross-section each 
second as are contained in a prism of length u+, i.e. N+v+. Similarly, 
N_v_ ions pass in the opposite direction; this corresponds to a stream 
of positive charges in the direction of the field. Thus i is the value of 
the current density. Since the solution as a whole is neutral, 


N+ 2 + = N_z_, (2) 

so that 

i = N+z + e{v+ -f v_) (!') 


If Ohm’s Law is to hold, i.e. if t is to be proportional to the field 
strength E, then according to equation (1), v+ as well as v_ must be 
proportional to E and independent of the time. One might at first 
imagine that a constant force ze E, which acts on an ion of valence 
z , would produce an acceleration, i.e. a velocity varying with time. 
But in addition to the field, the ion is subject to the frictional resis- 
tance of its motion through the solvent. If we make the somewhat 
bold assumption that Stokes’s Law (p. 222) holds even for spheres 
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of atomic size, we have a resistance / proportional to the speed: 

/ = 6t ttjRv, (3) 


where R is the radius of the sphere and 77 is the viscosity of the 
fluid. After a short period of acceleration there is equilibrium between 
the force exerted by the field and the resisting force, so that 




The velocity attained in unit field is called the mobility of the ion: 


v ze 

“ = E = 6^jR (5) 

The mobility is usually referred to a field of 1 volt /cm. Introducing 
the mobility into equation (1) (p. 418), we have 

t = (z+N +u+ -f z_N_u_) eE = N+z+eE {u+ -f u_), . (6) 

and the conductivity becomes 

<r = {z+N +u+ -f z_Nju_) e = N+z+e ( u+ -f u_). . (7) 


Thus the value of the conductivity alone does not give the mobility 
of each kind of ion, but only the sum of the two. In addition, we 
must know another relation between the two mobilities, e.g. that they 
are equal, or that their ratio has a given value. This may be deter- 
mined experimentally (cf. below). We see also from equation (5) that 
the mobility is greater the smaller the viscosity of the solvent. Now 
the latter decreases, for all liquids, with rising temperature, so that 

we have an explanation of the increase of electrolytic conductivity 
with increasing temperature. 

The ratio of the mobilities may be determined by measuring the 
changes in concentration occurring at the electrodes when a current 
passes The natural diffusion tending to cancel these differences in 
concentration is prevented by inserting a porous diaphragm into the 
vessel We shall compute the mobility ratio in this way for an electro- 
lyte whose 10 ns are both univalent. There are deposited at the cathode 
not only the cations which have migrated there, but also those set 
free by the moving away of their oppositely-charged partners. That 
is, only one kind of ion need have appreciable mobility in order that 
there may be deposits at both electrodes. If it were otherwise, the 
dectrolyte would have an excess charge at the electrodes; this charge 
would equalize itself by deposition of the superfluous ions. Fig 1 is 
a schematic diagram °f an electrolytic cell in which we assume the 
ratio of mobility of the cation to that of the anion to be 3 : 2. Only 
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Inv ' rt hiC \ are Set free b y the departure of the aniona have 
any effect on the change m concentration at the cathode. Hence the 

+ 
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deposited. 


deposited 


Fig. i 


change in concentration at the cathode, expressed in gra mm e molecules 
of the solute is 

N_u E 

( 8 ) 


An s = 


L 


where E is the field strength and L is Avogadro’s Number— the 

number of molecules per mol. The change in concentration at the 
anode is 




Since N_ — when both ions are univalent, we have 

A n K _ u_ 

&n A ~ 



The losses in concentration at the respective electrodes , expressed in gramme, 
molecules of the solute, are inversely proportional to the mobilities of the 
ions arriving there. This relation was given by Hittorf. 

In this form, the law holds also for an electrolyte whose ions have 
valences z+ and z_. The chemical formula of the solute is then K z _A l+ , 
where K represents the cation, A represents the anion, and z + and z_ 
are relatively prime. It is true that we now have z_N_ = z+N+ in 
place of simply N_ = N+, but the decrease in concentration at the 
anode, expressed in gramme molecules of the solute, is calculated from 
the decrease in the number of cations by dividing by z_L (cf. 
the chemical formula), so that the valences cancel in forming the ratio. 

The ratio of mobilities gives also the ratio of the parts of the total 
charge carried by each kind of ion. Denoting the fraction of the total 
charge carried by the cations by q+, that carried by the anions by 
q_ = 1 — q+, we have by equation (6) (p. 419), 

q+i — z+N+u+eE, 
q_i — z_N _u_eE, 

whence — = t — — — = — (H) 

q- 1 — q+ u - 


Following Hittorf, q+ and q_ are called the transport numbers. 
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According to equation (6) (p. 419) and equation (11) immediately 
above, the individual mobilities of the ions may be determined from 
measurements of conductivity and transport ratios. These mobilities are 
very small. The following are several values in cm. sec. _1 /volt cm. -1 : 


Li+ 36-8 x lO" 6 

Na+ 46-0 x 10" 5 

K+ 67-6 x 10 -5 

Mg++ 47-6 X 10" 5 . 


If we use these values in conjunction with Stokes's Law to determine 
the ionic radii, we obtain values of the correct order of magnitude, 
10“® cm. But we see that these radii are not the same as those of 
the free ions in the gaseous state, for — contrary to well-founded 
results of atomic physics — they decrease in the order Li-Na-K, rather 
than increase. The reason is that there exists a stronger electrostatic 
field at the ‘ surface ” of a small ion than at the surface of a large one, 
assuming the charge to be symmetrically distributed over a sphere. 
This field causes the attachment of molecules of the solvent, the effect 
becoming more feeble toward the outside. The observed radius is, 
then the effective radius of the ion together with its shell of molecules 
of the solvent. For this reason the mobility of the bivalent Mg++ 
ion is not very much different from that of the similar univalent Na + 
ion, although for a given field, the force on the former is twice as great. 
If we omit from consideration the constituent ions of water, H+ and 
OH-,* we may say that the mobilities of all ions are approximately 
equal, for if an ion has a greater charge, and hence a greater driving 
force, it will also have a greater effective radius, and hence experience 
greater resistance to its motion. 


2. Dependence of Electrolytic Conductivity upon Concentration. The 
Theory of Debye-Hiickel and Onsager. 

.If f Cre be electrolyte in which p gramme equivalents are 

iwlrfh “ ea vr °f 1C " entunetre of solution > “d let the fraction 
ionized be a, while the fraction 1 - a remains in the form of undis- 

i °rt CUj f ' and ( so bas no efiec t on the transport of electricity. 

by equation S) ^ ^ 0m * m *' 5 * “ P™ 

o — a pLe{u+ -f- u_) = a pF(u+ -f- u_). . (J2) 

Sfontntmtioid 63 "”^ ”•**** A ’ WouJd be independent of 

1. a were independent of the concentration, and if 
. the mobilities u+ and u_ were also independent of this quantity 

*■ oSrs.'r assir'x-sass ixxvz* m * «■* 
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The observed very pronounced dependence of A upon the concen- 
tration can be explained only by discarding one or both of the 
above assumptions. The first alternative — rejection of 1 and reten- 
tion of %— corresponds to the classical theory of electrolytes con- 
nected principally with the name of Arrhenius.* At first sight the 
fundamental assumption of this theory seems very plausible: the 
frictional forces are assumed to result wholly from interaction with 
the molecules of the solvent. This is quite probable; for example, in 
a molar solution of NaCl the ratio of the number of ions per c.c. to 
the number of water molecules is about 18 : 1000. This hypothesis, 
however, overlooks the enormous electrostatic forces exerted by the 
elementary charge within a range of a few atomic radii. 

The presence together of molecules and their products of dis- 
sociation represents a type of chemical equilibrium governed by a 
thermodynamic law known as the Law of Mass Action (cf. p. 542 
et seq.). For the case at hand this rule takes the form 


where the mass action constant K depends only upon the temperature 
T. From this we see that a approaches the value unity as the con- 
centration is decreased. For small values of the concentration we 
may set a = 1 in the numerator, in which case we see that the 
degree of association 1 — a is proportional to the equivalent concen- 
tration. If the classical formulation is correct, the ratio of the equiva- 
lent conductivity A for finite concentration to that extrapolated to 
zero concentration, Ao, must be equal to the degree of dissociation 
a, which in turn must follow the law of mass action. This is found 


to be true for the greater number of weak electrolytes, i.e. substances 
— chiefly organic acids and salts — having low values of A at moderate 
concentrations. On the other hand, for strong electrolytes — the 
commoner inorganic acids, bases and salts — we find great deviations. 
In some instances the expression 



varies by several powers of ten. For this reason theories based on the 
alternative point of view were proposed from the very beg inn ing. These 
speculations were based upon the assumption that dissociation is 
complete even for relatively concentrated solutions (up to perhaps 
1/100 molar), but that the ions exert mutually hindering forces upon 

* S. Arrhenius, Zeitechr. phys. Chem., 1, p. 631 (1887). Lthrbuch der EUkirochtmit, 
Leipzig, Quandt & H&ndel (1901). 
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each other. It was Debye and Hiickel who first succeeded in over- 
coming the extreme mathematical difficulties attendant upon the 
computation of these inter-ionic forces. Their theory was considerably 
improved later by L. Onsager. We are able to give only the main 
features of the calculation here; the original papers should be con- 
sulted for details.* 

If we consider a given positive ion and note the number of ions 
of both kinds in its neighbourhood, there will be, when averaged over 
a long time or over many positive ions, a preponderance of negative 
ions in its imm ediate vicinity. If we compute the average distribution 
of charge in this region, we find it to consist of a spherically sym- 
metric negative charge whose density falls off rapidly with distance 
from the centre. Similarly, a negative ion is surrounded by a cloud 
of positive charges. The manner in which these clouds of charge come 
about may be seen by considering a crystal lattice (fig. 1 , p. 573). 
Here also, the nearest neighbours of each ion are ions of opposite sign. 
We may picture the structure of an electrolyte in a similar way except 
that the lattice structure is replaced by one which is more “ plastic 
The computation is made possible by imagining the charge in the 
neighbourhood of a single ion to be replaced by the average continuous 
volume charge obtained by averaging over the actual ion configurations 
from instant to instant. The density of this average ion cloud is indi- 
cated in fig. 2, Plate I, by different degrees of shading. 

Besides the Stokes resistance, there is a retarding force of the 
following nature: it requires a certain time for the cloud of ions to 
attain its spherically symmetric equilibrium configuration. If the 
central ion is moved forward a distance ds in the time di y the surround- 
ing charges tend to retain their former configuration. This means that 
there is a deficiency of charge of opposite sign on the forward side and 
an excess on the rearward side. The electrostatic forces then retard 


tftr ^ or a ^ ti f 0n “ f ° rce ? f relaxation ”, there is a further re- 

tarding effect, the eleotrophoretic force. According to p. 218, when a 

sphero moves through a viscous fluid, the velocity® of the fluid atthe 

Ti® S , phere “ the same as that of the sphere, and decreases 
outward. If, then, negative ions travelling in the opposite direction 

3£TS£ “ the ^ lghbourll ood ° f ^e positive ion, the resultant 

£ otL™wT!f^- 10nS 18 to ° pp03e the motion of the positive 
ion, otherwise stated, this ion moves through a region of the solvent 

^ ° P /T te “m T?e°results obtained 
by quantitative investigation of these ideas are: 

, i* . e m ° bi % of a given ion depends not only upon the nature 
Of the ion and the solvent, but also upon the nature cFZ other Z 

P. MB (lMaT “ d Q ' HOo “' Zcit ' P- ™ (1923). L. Onsager. ibid, g}, 
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The independence of ionic motion is valid only for infinite dilution. 
The numbers given in § 1 (p. 421) are limiting values of the mobilities 
at zero concentration. 

2. The two retarding forces are, like the Stokes resistance, propor- 
tional to the velocities of the ions, to a first approximation. Thus 
Ohm s Law remains valid, although only to a first approximation for 
low field strengths. As a matter of fact, M. Wien found consider- 
able departures from Ohm’s Law for very strong fields (10 6 volts /cm.). 
He observed an increase in conductivity with increasing field, in good 
agreement with the theory.* In the limit of infinitely strong fields the 
added retarding forces vanish entirely, so that the remaining depar- 
ture from the limiting value of the conductivity for infinite dilution 
is correctly ascribed to association. 

3. For very dilute solutions the departures from the limi ting value 
of the conductivity, i.e. the quantities 1 — A/A 0 are no longer pro- 
portional to the concentration itself, as in the law of mass action, 
but to the square root of the concentration. This rule was advanced 
as an empirical fact by Kohlrausch long before the theory was pro- 
posed. 

4. For a number of equally concentrated solutions of the same 
solute in different solvents, the departures are greater for the solvents 
of smaller dielectric constant. To see that the inter-ionic forces must 
increase with decreasing dielectric constant we need only recall that 
the expression for the force between two point charges has K in the 
denominator. 

The development of the cloud of ions about a given ion is imper- 
fect for rapidly alternating fields as well as for very strong fields. As 
a result, the added retarding forces are absent when high-frequency 
alternating currents, corresponding to a wave-length of 10 m. or less, 
are used. Going to still shorter waves, the electrolytic conductivity 
stops altogether when the wave-length reaches a few centimetres, as 
has been known for a long time both experimentally and theoretically. 
The reason is that the ions, on account of their inertia, are no longer 
able to follow the alternations of the field. This provides us with the 
connexion with optics. In the optical range, solutions of simple salts 
like NaCl are to be considered insulators, as is shown by their 

transparency. 

Ex. 110. In order to gain an idea of the magnitude of the electrostatic forces 
calculate (a) the attraction between two elementary charges e = 4-80 X 10 
e s u which are at a distance of lO" 8 cm. apart; (b) the attraction between the 
positive and negative ions in 1 mol of rock salt which are separated and placed 
at a distance apart equal to the earth’s diameter (12,700 km.); (c) the surface 
pressure necessary to hold together a sphere of 10 cm. radius into which all the 

ions of one sign are packed. 

* M. Wien, Ann. d. Phys., 83, p. 327 (1927). 
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The Conduction of Electricity in Gases 


1. Direct Determination of the Elementary Electrical Charge by the 
Millikan Oil Drop Method. 

Before we treat the theory of electrical discharge in gases we wish 
to decide whether the value of the elementary charge on a univalent 
ion found in electrolysis is merely an average value (this is all we 
can infer so far) or the true value of a smallest unit of charge. The 
experimental proof of the correctness of the latter assumption was 
given by Millikan. The arrangement used by him was essentially 
that described on p. 263 . The procedure is to observe the motion, in 
an electric field, of small test bodies carrying a few elementary charges. 
Millikan used droplets of oil formed by atomization. Their motion 
was observed in a microscope, the droplets being ill umin ated from the 
side. The droplets are in a vertical electric field. With the field off, 
the terminal velocity of fall v 0 is determined by the equilibrium be- 
tween gravity and the Stokes resistance to motion through the air. 
If we assume— as we may for oil drops— that the droplet is spherical 
and of radius a, and if we call the density of the oil a , that of the air 

<j Ai and 7 ) the coefficient of viscosity of the air, then we have the equa- 
tion ^ 

^av 0 = ( 1 ) 


From a measurement of v 0 we obtain the radius of the droplet and 
thus its mass. If we now apply an electric field E which, with gravity 
urges the drop downward, and if we again determine the velocity 


67777^ = e { E -f- 


477 

T 



(* — °a)9- 


The charge on the drop is then given by 




E 
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The result of Millikan’s experiment is that the charges e { are small * 
integral multiples of a unit whose magnitude is 

e = 4-80 X lO-io e g u = 2.00 x 10 _, 9 cou i om b 3 . 

For the exact calculation it is necessary to introduce a kinetic theory 
correction to Stokes’s Law, since the size of the drop is no longer large 
compared with the mean free path of the air molecules. Several in- 
vestigators have expressed doubt as to the justification of some of the 
assumptions embodied in the experiment, viz. the accurately spherical 
form of the droplets, the identity of their density with that of the oil 
in bulk, and the decrease in mass due to evaporation during the period 
of observation. It is fairly certain, however, that any errors attributable 
to these causes are very small. 

The Millikan experiment not only proves the existence of the 
quantum of electricity e, but also gives the most accurate value of this 
constant so far obtained. This quantity also furnishes a value of L, 
since the Faraday equivalent charge F = Le is directly measurable 
with great accuracy. There results 

L = 6-02 X 10 23 mol- 1 = 6-02 X 10 26 (kg. mol)" 1 . 

2. The Nature of the Cathode Ray Particles. The Electron. 

If a glass tube provided with two electrodes is pumped down to a 
pressure of a few thousandths of a millimetre of mercury, and if an 
electrical potential of several thousand volts is applied, one observes 
a green fluorescence on the wall of the tube opposite the cathode, what- 
ever may be the position of the anode. The fluorescence is even stronger 
if observed on a screen of zincblende placed within the tube opposite 
the cathode. The fluorescence is said to be caused by “ cathode rays ”. 
The fact that the fluorescence spot may be deviated by allowing the 
rays to pass through an electric or magnetic field is of prime impor- 
tance in obtaining a knowledge of the nature of these rays. The direc- 
tion of deviation is such as to indicate that the rays are composed 
of negatively charged particles. We obtain the same rays if we use 
as a cathode a wire filament heated to incandescence by an electric 
current. Moreover, with this cathode, we still obtain the rays when 
the tube has been pumped down to pressures at which it would .no 
longer pass a current with a cold cathode. This method of generating 
cathode rays is to be preferred, as we shall see presently, and we shaU 
assume such an arrangement to be used in the experiments discussed 
below. Let the anode be a plane metal plate having a small hole m it 
and placed opposite the cathode. The greater part of the rays from 
the cathode strikes the anode. A current-measuring instrument con- 

• Naturally, on account of unavoidable errors of measurement, only small multiples 
can be demonstrated. 
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neoted between the electrodes indicates the passage of a current; thus 
the cathode rays transport electric charges. Let the narrow beam which 
passes through the hole in the anode strike a fluorescent screen some 
distance away. The path of the ray between anode and screen may 
be influenced by applying electric or magnetic fields. In analogy with 
electrolysis, we assume the cathode rays to consist of particles of mass 
m (still undetermined) and charge e. We note at once one important 
difference between the two phenomena: if we reverse the potential 
applied to the tube no current flows. The particles effecting the trans- 
port of electricity in this instance are all of one charge, negative. 

We can calculate at once the velocity of the particles from the con- 
servation of energy: 


or 


eV = i mv 0 2 , (4) 

••-VF 


The values of e and m are still unknown; we seek to obtain the neces- 
sary relationships from experiments on electrical and magnetic de- 
flection of the particles. The deflection by an electric field may be 
calculated very simply. Let the field be normal to the original direc- 
tion of motion, which we take as the s-axis of a rectangular system of 
co-ordinates. Each particle is then subject to a constant force eE 
in the y-direction. The problem is thus identical with the computation 

of the trajectory of a particle thrown horizontally in a gravitational 
field. Starting with 

mx = 0 'v 

my = eEi 


we integrate twice and apply the initial conditions, obt aining 


x=vj. ] 




Eliminating the time, the trajectory is found to be the parabola 

eE , 

y= 2^ x W 

The electrical deviation after passing a distance a through the field 
of strength E is then 

, eEa 2 Ea 2 

el - 2 mw 0 2 ~ TV’ 

i.e. by inserting the value of v 0 from (4'), e/m drops out of the equation. 
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Hence the electrical deflection alone gives only the potential F, which 
is directly measurable in our case. 

W e seek to obtain a further equation from the magnetic deflection 
In order to simplify the arrangement we apply the field along the 
z-axis m a direction normal to the initial direction of motion of the 
particles. On p. 309 we calculated the force exerted by a magnetic field 
on an element of a conductor carrying a current in a vacuum to be 

F = - c [iU]dr. 

But a volume element containing charge of density p and moving with 
velocity v is equivalent to a portion of a convection current of density 

pv (cf. p. 418). The charge e is equivalent to pdr, so the force on the 
element of current becomes 


F = *?[«H]dr = j>H] 



It follows that the force is always normal to the vector v along the 
orbit, hence the magnetic field can do no work: it can change the 
direction of v but not its magnitude, so that the kinetic energy is 
not changed. Further, since F is also normal to H, no acceleration 
occurs along the 2 -axis. Thus the 2 -component of the velocity remains 
zero if— as we assume — it was zero when the particle entered the 
field. On the other hand, the field imparts a y-component of velocity 
to the particle. We have 


dx . , dy . 
v = — z -f -f ; 
dt dt J 


and H = Hk , . . 

so that the component forms of the equation of motion are 

d2x ^ e _ ijdy 


( 10 ) 


m 


dt 2 


dt 


d 2 y e rjdx 

m = li -r . 

dt - c dt J 


( 11 ) 


Integrating once and applying the initial condition that the velocity 
has the direction of the x-axis and the magnitude v 0 when x = y = 0, 
we obtain 


dx e TT . 

d y=-^nx 

dt me 


( 12 ) 


Squaring and adding these two equations and noticing that the mag- 
nitude of the velocity is constant, we obtain the following equation 

of the trajectory: 2 ^ 0 n (m 

s 2 + y 2 + y= 0 t 13 ) 
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This is the equation of a circle touching the x-axis at the origin and 
of radius 


P = 


mcv 0 

TH' 


(14) 


The paths of charged particles are bent into circular arcs in a magnetic 
field normal to the initial direction of motion. These circles lie in a plane 
normal to the field and their radii are directly proportional to the initial 
velocity and inversely proportional to the strength of the field. If the 
magnetic field is of limited extent such that only the segment b of the 
a^axis is in the field, and if E is not too great, y 2 may be neglected as 

compared with y in the equation of the path, and we find the magnetic 
deflection to be given by 


'mag 


eHb 2 



2mcv 0 


Ub 2 



2cV2FV m * - * * (15) 

where v 0 is replaced by its value from equation (4') (p. 427) This 
approximation is equivalent to neglecting the change in the com- 
ponents of v caused by the field itself in the right members of (11) 
i.e. to setting dyfit == 0 and dx/dt = v 0 . It is not possible to determine 

S; ! mce 0B]y tW0 e( 5 uations > but equation 

equation (15) above represent two equations for 

®„ and e / m - numerical value of e/m thus found is q 

g 

m = 5 ' 273 X 1017 e s - u -/gm. = 1-759 x 10 11 coulomb/kg., 

i.e. 1840 times as great as the ratio of charge to mass for the lightest 
electrolytic .on, the H ion. Since we always find the Lrne value of 

e/m regardless of whether we use a heated cathode or a cold one and 
regardless of the nature of the residual f?as in thp fuho \ 1 

ISr 1 th6 ^ £ magnitude ‘of 'the 

electrolytic element of charge and thus conclude that the particles in 
the cathode rays are 1840 times smaller in mass than Ilf 
atom. This interpretation is the only IogTca“one for if 1 7 8 f 
assume the particles to have a mass equafto that of the ^ ^ 

dent S k 1 ^ 

charge and Avogadro’s constant, the mass of the 

m o = 9-11 x 10- 2 8 gm . 

The mass of the hydrogen atom is 1-67 x 10- 21 gm 

ments Lw the mtfo m ° rea ~ measure- 

as the velocity of the particles increases' kSm d ® crease 

succeed m demonstrating this variability tt! the 6x84 

„d ^ i „ ."Sft 
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same direction but with various velocities in such way that the electric 

and magnetic deviations were at right angles to each other Take 

a co-ordinate system in the plane of the observing screen, the origin 

being at the point where the undeviated beam meets the screen, with 

the x- and y - axes in the directions of electric and magnetic deviation 

respectavdy Then, according to equation (8) (p. 427) and equation 
(15) (p. 429), 

_ eEa 2 _ etlb 2 
2 mu 0 2 ’ y ~ 2 mcv 0 ' 

If we look upon v 0 as a continuously variable parameter, these are the 
parametric equations of a curve traced upon the screen by the elec- 
tronic rays of various initial velocities. Elimination of v 0 yields 

2 incrEa 2 „ 

x= ~l TW y ' 


a parabola passing through the origin and having as axis the x-axis. 
Even if the experimental curve does not extend quite to the origin, 
it is possible to determine the tangent at the origin (which corresponds 
to infinite velocity) by reversing the electric field. The result of the 
experiment was, however, that the tangent formed a small but un- 
mistakable angle with the y-axis. In addition to e/m, the equation of 
the curve contains only constants of the apparatus. The discrepancy, 
is, then, interpretable only by assuming that e/m varies, and in such 
way that it decreases as v 0 increases. Now the relativity theory pro- 
vides for an increase of inertial mass with increasing velocity accord- 
ing to the formula (cf. p. 254) 



while there is no reason whatsoever for assuming any variation of e. 
As a matter of fact, Kaufmann’s results are in good accord with this 
formula. Abraham deduced a formula for the variability of mass, based 
upon the concept of a fixed ether, which gives slightly different de- 
partures. More exact experiments, however, gave results decidedly 
in favour of the relativity formula (see p. 478). The relationship be- 
tween potential drop V and velocity must also be modified from the 
relativistic point of view. The correct energy equation replacing 
equation (4) (p. 427), is 





= eV. 


. . ( 16 ) 
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Finally, all e/m determinations must be reduced to zero velocity; this 
has already been done for the numerical values given above. 

Ex. 111. In deflection experiments the screen is generally not exactly at 
the end of the field, but there is a field-free space between it and the edge of the 
field. Show that one may treat the deflections observed on the screen as if the 
rays were bent at the middle of the field by the angle which the tangent to their 
path at the end of the field makes with the original direction of motion. 

Ex. 112. Theory of the “ Magnetic Lens **: A slightly divergent beam of elec- 
trons emanates from a point source, the vertex angle of this cone of rays being 
2a. Assume all the electrons to have the same speed v. Show that a magnetio 
field parallel to the axis of the cone will re-unite the rays approximately at a 
point a certain distance away from the source. 


3. Survey of the Possible Methods of Generating Carriers of Electricity 
in Gases. 

Preliminary remarks concerning the structure of matter 
and the interaction of light and matter 

. At tbe P res ent time we are certain beyond doubt that the atom 

is a complicated configuration of electric charges. The entire mass 

of the atom, with the exception of the negligible mass of the outer 

electrons, is concentrated in the positively charged “ nucleus ” 

whose diameter— 10 ■“ cm.— is vanishingly small compared with that 

of the whole atom (kinetic theory radius 10~ 8 cm.). This nucleus is 

surrounded by a shell of electrons which fill a space of about the size 

of the kinetic theory volume. In an electrically neutral atom the 

number of electrons outside the nucleus is equal to the resultant number 

of positive elementary charges in the nucleus, and this number is 

identical with the atomic number, i.e. the number giving the place of 

the element in the periodic table. An atom which his lost one or more 

of its outer electrons as a result of some external influence represents 

a positive ion, while an atom with an extra electron is a negative ion 

A system having several nuclei is called a molecule. A molecule as 

well as an atom, may become either a positive or a negative ion ’ It 

is chiefly the halogens whioh come just before the noble gases in the 

periodic system, which tend to form negative ions. These elements 
form the singly-charged anions F~ Cl- Br~ I- elements 

r . “” p ” 1 " ,h * h « »' » 

To remove a single electron from an atom or from the surface of a 
metal requires a definite amount of energy whioh rhftranf • *.• * 

the particular substance. This work of escave is araote ^° of 

giving the potential difference through whLh an il 7 s P ecified b 7 
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ionization potential for free atoms varies from 3-88 volts for caesium to 
24-48 volts for helium. Once an electron has attained the critical speed 
it is able to dislodge an electron from the electron shell of an atom 
with which it collides. This happens relatively frequently as soon as 
the electrons have reached the critical speed, while the ionizing col- 
lisions of fast electrons are less frequent. The impact of a positive 
ion can also result in ionization of an atom, but the impinging particle 
must have a considerably greater speed, for according to the laws of 
impact, there is a considerable transfer of kinetic energy of trans- 
lation when two approximately equal masses collide. But even for 
speeds theoretically sufficient for ionization, the number of effective 
collisions is minim al. In contrast with electronic impact, the number 
does not increase until the velocities become greater. Instead of a 
positive ion, a neutral atom may impinge, and again ionization will 
occur under certain conditions. Since neutral atoms usually possess 
velocity as a result of thermal energy of agitation, this process, which 
is of importance at temperatures of 3000° C. and beyond, is called 
thermal ionization. There are thermodynamic formulae which make 
possible the calculation of the degree of ionization of a gas at a given 
temperature and pressure. 

Finally, radiation of short wave-length (<3000 A.) also causes 
ionization. In all energy exchanges between atoms and radiation , light 
acts as if it were composed of discrete quanta of energy , of amount pro- 
portional to the frequency of the light , which are transferred as units. 
The factor of proportionality is denoted by li and is Planck’s celebrated 
quantum of action , whose numerical value is 

h — 6-62 X 10 -27 erg sec. 

In order that there be ionization by light — photo-ionization — the 
incident “ light quantum ” hv must be at least equal to the work of 
ionization. Calling the latter eV t , we have 

hv^eV, (17) 

If we characterize the light by its frequency instead of by its wave- 
length, insertion of the numerical values yields the easily-remembered 
relationship between wave-length and equivalent electron energy 

A x F, = 12380. 

(In A.U.) (in volts) 

Atoms struck by electrons whose speed is not sufficient to cause 
ionization may nevertheless be made to emit radiation provided that 
the electrons have fallen through a potential exceeding a value called 
the excitation potential V e . Then equation (17) holds in reverse, i.e. 

(17') 
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All the agents here described can also dislodge electrons from the 
surface of a metal. Thus the impact of electrons or ions on a metallic 
surface releases “ secondary electrons ” ; light incident on a metal 
releases “photo-electrons”; heat causes “thermal emission of elec- 
trons The last-mentioned phenomenon is of particular interest. 
Just as an absorbed gas escapes from the absorbing body when heat 
is applied, electrons may be “ evaporated ” from a metal by heating 
it to around 2000° C. The emission of electrons takes place at a lower 
temperature (600° C.) if the metal is coated with a t hm layer of alka- 
line earth oxides. 

For completeness we must mention the possibility of the for- 
mation of ions in the course of chemical reactions. An example is the 
ionization of air by slow oxidation of phosphorus or other substances. 

4. The Separately-sustained Electrical Discharge. Spark Discharge. 

A self-sustaining discharge is one in which the passage of the cur- 
rent itself continues to produce the necessary carriers of the charge, 
while a discharge in which these agents are produced entirely or in 
part by an external source of ionization, so that the discharge ceases 
when the source is turned off, may be referred to as separately sustained. 

The simplest example of a separately-sustained discharge is pure 
electronic conduction in a high vacuum. The electrons are liberated 
at the cathode either by radiation (photoelectric cell) or by heat, and 
move to the anode under the influence of the applied field. The maxi- 
mum value of the current is attained when all the electrons liberated 
from the cathode each second reach the anode. This current, which 
cannot be increased by raising the potential difference between anode 
and cathode, is called the 
saturation current. On the 
other hand, for small nega- 
tive values of the potential 
difference between the cold 
electrode and the hot one, a 
very few high-speed electrons 
will succeed in crossing the 

tube against the field. The Fi g . » 

voltage-current curve must 
therefore have the general appearance shown in fig. 1 . The rigorous 
derivation for the linear case (extended parallel plate electrodes) was 
given by Gans. The theory is somewhat complicated, since it is not 
con-ect to assume a linear variation of potential from cathode to anode 
(which would, of course, be valid if the cathode were cold) on account 

the *atom. W ° rk ° f * metal “ leS3 thafc squired to free an electron from 
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of the distortion of the field by the cloud of electrons surrounding the 
hot cathode. The fact that no appreciable current passes when the 
potential difference is negative permits the use of the hot-cathode 
tube as an electrical valve (rectifier). If a wire grating or grid through 
which most of the electrons pass is interposed between anode and 
cathode, the electron current arriving at the anode depends very 
strongly on any difference of potential applied between the grid and 
the cathode. As a result, even a small alteration in grid potential 
causes the anode current to change considerably. This property — 
the ability of the tube to act as a regulator and amplifier — makes the 
three-electrode tube one of the most important devices in wireless 
telegraphy and telephony. 

We now consider the case in which the gas between the electrodes 
is artificially ionized, say by X-rays, while nothing unusual is assumed 
to occur at the electrodes. In this case, too, we may expect a satura- 
tion current, viz. when all the ions formed in the gas in unit time reach 
the electrodes without previously recombining with oppositely- 
charged ions. If q ion pairs are formed in each cubic centimetre per 
second, and if the electrodes are l centimetres apart, the density of 
the saturation current is i s = qle. This seems paradoxical at first sight: 
the greater the separation of the electrodes, the greater the current. 
It is to be remembered, however, that this result refers to the highest 
attainable current, and that the potential must be raised when the 
electrode separation is increased in order to maintain the saturation 

current. 

We find experimentally that a new current increase occurs when 
the potential is raised further. This is due to the formation of new 
secondary ion pairs by the primary ions and electrons. Instead of 
treating this somewhat more complicated case, we shall investigate 
the simpler instance in which the primary electrons are formed at the 
cathode only, rather than in the gas. By means of light or some other 
agent, let n 0 electrons be liberated from each square centimetre ot 
the cathode every second. Then the saturation current will have 
the density i, = n 0 e. Let each electron form a secondary ion pairs 
per centimetre of its path. Assume the velocity of the positive ions 
to be insufficient for ionization. If, in addition to the n 0 primary 
electrons, n^x) secondary electrons pass through a cross-section at * 
each second— a total of n(x) electrons-then between x a,n& x + dx 

dn = nadx 

additional electrons are released each second. Integrating between 
“d x = l, we obtain the total number of electrons arriving at 

the anode 

.... ( 19 ) 


n = n 0 e al 


i.e. i = en Q e 


al 
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This same value of the current density is equal to the total charge 
on the positive secondary ions arriving at the cathode plus that of the 
departing negative primary electrons. The form of equation (19) 
shows that the photoelectric current may be increased enormously 
by filling the cell with gas. 

We observe no sharp limiting current because there is a second 
possibility of ionization when the potential is increased, viz. ionization 
by collision of positive ions with atoms. Assume each positive ion to 
form p ion pairs per centimetre of path. Let n 0 primary electrons be 
released from each square centimetre of the cathode per second by 
some external agent, and let the total number of electrons arriving at 
the anode be Since no positive ions are liberated at the anode, 
is also the measure of the total current, which, at the cathode, 
consists of the electron current plus the ion current. Considering again 
a thin layer between s and x + dx (fig. 2), there are n pairs of ions 
formed to the left of x per second and — 
neglecting infinitesimals — (n x — n) pairs 
formed on the right side. For a satura- 
tion current at equilibrium we must also 
have n electrons crossing the left face of 
the layer from the left, per second, and 
(tti — n) positive ions crossing the right 
surface from the right. Hence the number 
of electrons released on the path dx per second amounts to 

dn = [na + (n x — n)P]dx = [n (a - p) + n^dx, (20) 
whence, by integration, 






i 

i 

I 

i n r n. 


n, 


dx 


Fig. a 


n(a— P) -f n^P— Ce ( 




• ♦ 


( 21 ) 


We must have n=n 0 when x = 0, so that 

C=n 0 (a —p)+ ni p ( 22 ) 

The single constant of integration is thus determined. The boundary 
condition for x = l leads to an equation for n x : 

*!(* “* P) + KiP = Oo («—/*) + ^p] e<— . ( 23 ) 

Solved for Wj, this is 

ftp (a — p)ei a ~ffl 


n i a — p e («-e)i • 


.... (23') 

It is interesting to notice that the denominator becomes zero if 

a =p e <a- m or ; = log q/g 

a — p' * • • 

In this case even a very small value of ^ is accompanied by a very 
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large (mathematically infinite) value of n,. Then the function of the 

primary electrons is only to initiate the process known as electrical 
breakdown or spark discharge. 

The ionization coefficients a and ft will be proportional to the 
number of collisions on unit path and also to the kinetic energy acquired 
between two collisions. Since the number of collisions is proportional to 
the pressure, while the energy acquired on the free path is proportional 
to the ratio of the field strength E to the pressure p* we can write 

accr<t,(^), flocpt^ ( 25 ) 


Remembering that E = V /l for plane electrodes, and designating by 
q a factor of proportionality which does not contain either V or p oi 
l, equation (24) yields 


lp=q 


lo JJIM 

e HViip) 


MV /ip)- *(V lb)' 



In place of the ionization coefficients we now have functions of F, 
l and p in the relationship for the spark discharge. Considering this 
an equation for the spark potential F, we see that for plane elec- 
trodes the spark potential is proportional to the product of electrode separa- 
tion and gas pressure , i.e. to the amount of gas between the electrodes. 
This law, which was discovered empirically by Paschen, holds also 
for spherical electrodes if the radii of the spheres are increased in the 
same ratio as their distance apart. 

Paschen’s law is experimentally well substantiated, but the starting 
mechanism assumed here is not the only one that leads to this 
result. In fact, according to present knowledge, ionization by positive 
ions plays only a minor role, while indirect release of electrons from the 
metal of the cathode is much more important. The latter process 
comes about in this way: An electron not onlycauses ionization but 
— even before it has enough energy to do this — it excites the emission 
of light from atoms with which it collides. This radiation releases 
electrons from the cathode photoelectrically, since the work of escape 
from the metal is less than that from free atoms. The condition for 
initiation of the discharge is that each primary electron shall liberate 
at least one electron from the cathode. Let y be the average number of 
electrons released indirectly from the cathode by one electron. Then, 
since the number of secondary electrons liberated in a distance l by a 
single primary electron is, by the above, e al — 1, we must require that 

y(e al — 1) 1 (27) 


* For the energy is proportional to the product of field strength by free path, and 
the latter is inversely proportional to the pressure. 
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Since the same thing is true for the excitation of atoms to produce 
radiation as was said above for the coefficients a and we see that y, 
too, is independent of V (Ip and Paschen’s law holds for this starting 
mechanism also. 

This assumed process seems to be valid up to pressures of about 
200 mm. of mercury. Cloud chamber photographs show that at higher 
pressures the discharge is initiated by “ canals ” which propagate 
themselves with the aid of irregularities in the field produced by the 
space charge due to positive ions (see next section). 

5. Self-maintaining Discharge: Glow and Arc Discharges. 

In the foregoing calculations the distortion of the field by space 
charge was neglected in examining the initiation of discharge. Whether 
this is justified seems doubtful at the present time (see preceding 


1. Aston dark space 

2. Cathode sheath 

3. Crookes dark space 
A. Negative glow 

5. Faraday dark space 

6. Positive column 


Fig. 3 



section). At any rate, the steady forms of discharge, the main types 
of whrch are the glow discharge and the arc, are charactered by^ 

siderably afte“ sla^gV ^ — 

:^Lrair p o e fis 

cause a disruptive dischargl to 

“ gW g -^= tS ih mam °, f the P ° tentia ’ variation in a 

filfed with gt It bw preL “hi 2il7 a tube 

curve manifest themselves in part bv differ Se ° menfcs of potential 
The nomenclature is given k f£ 3 & ^ ° f lu ™^- 

potential at the t»ttodo iao.Wtt.Lw, 
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is defined as the difference in potential between the cathode and the 
point of minimum field strength, i.e. least potential gradient. This 
potential difference is independent of the current strength and of the 
gas pressure and depends only upon the nature of the gas and of the 
metal of the cathode, provided the current remains below a certain 
value. This limiting condition may be recognized visually by the fact 
that the cathode is just covered by the negative glow at that stage. 
The order of magnitude of the normal cathode drop is 200 volts. The 
cathode drop is the vital feature of a glow discharge; it corresponds to 
approximately the lowest potential with which a discharge can be 
maintained. What, then, is the source of the great lack of symmetry 
between cathode and anode? It is based upon the difference between 
the electron and the positive ion; this difference is responsible for 
quite different conditions for positive and negative space charges. On 
account of the relationship 


eV = eEl f = \mv 2 . 



(l f = mean free path), an electron attains a velocity ^1840.4 times 
as large as a positive ion of atomic weight A in falling through the 
same potential difference. Moreover, on account of the vanishingly 
small size of the electron, its free path l f is greater than that of the 
positive ion. Thus the electron engages in fewer collisions and so can 
accumulate more energy. According to the kinetic theory of gases, 
the mean free path of an atom of radius r 0 projected into a cluster 
of stationary atoms of radius r v of which the number per cubic centi- 
metre is n^, is given by 



1 

TT(r Q -f- 



If these atoms are also in motion with velocities approaching that of 
the impinging atom, the factor l/\/2 must be supplied in the right 
member. The velocity attained by the ions is of the order of magni- 
tude of the gas-kinetic velocity, so that the second assumption is valid 
for them; the electrons, on the other hand, move so much faster that 
we can consider the gas atoms relatively at rest. Hence for the elec- 
tron (r 0 = 0) the mean free path is 


while for an ion, 


K = 


1 


TTTiV 


li = 


1 


47rr 1 2 n 1 \/2 




In a given field the velocity of the electrons is to that of the ions as 

mVA:l. 


V 184(L4-4\/2 : 1, or as 
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The region of the discharge which is free of space charge is charac- 
terized by linear potential rise, for Poisson’s equation (p. 268) de- 
mands that d 2 V jdx 2 vanish where there is no space charge. Hence, 
at such places, 

n + = n_ 

i — i+ + i- = en + v + -J- en_v_, 

so that 

l f = - = mVA. . ... i . (32) 

In the region of no space charge, i.e. essentially in the positive 
column, the portion of the current carried by the electrons is at least 
one hundred times that carried by the positive ions. 

If we now assume for simplicity that the current at the cathode is 
carried only by positive ions, that at the anode by electrons alone, 
and that the field is the same on both sides, then — since i = Nev — 
th e rati o of the space charges Ne before cathode and anode is at least 
Vl02 A : 1. Therefore, on account of Poisson’s equation 

d 2 V 

~ = _477p = -i-nNe, (33) 


the potential curve must turn sharply downward at the cathode and 
slightly upward at the anode. Naturally, these considerations are only 
qualitative; in reality, part of the current at the cathode is carried 
by electrons dislodged from the metal by the impact of ions which 
have attained considerable speed in traversing the cathode fall. Pre- 
cisely this liberation of electrons is necessary for the maintenance of 
the discharge. 

In the absence of an external resistance for limiting the current 
the glow discharge can go over into the arc form, in which the current 
IS several hundred times greater. The arc is distinguished from the 
glow by the fact that the heating of the cathode causes it to release a 
much greater electron current that would be furnished by positive 
ion impact alone. Thus the arc type of discharge presumes the use 

In many arcs the temperature of the gas is so high that th ermal 
ionization is of considerable importance in addition to ionization by 
electron impact. It follows from these remarks that the number of 
charge-bearing particles increases considerably as the current is made 
greater, since both the thermal emission of the cathode and the 
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thermal ionization of the gas increase exponentially with the tempera- 
ture. This causes the arc to have a so-called “ falling characteristic 
the resulting potential difference V across the arc and the current 1 
are related by an equation of the form 

V = a T* j> (34) 


which is the exact opposite of the ohmic relationship V = IR. 



A conductor having a falling characteristic V = /(/) must not be connected 
directly to a source of potential, as may be seen by considering fig. 4. If, through 
some cause, the current should increase, the E.M.F. opposing the battery potential 
diminishes; the current then increases, causing the potential to drop further, 
resulting in an additional increase in current. On the other hand, for a conductor 
with a rising characteristic (e.g. an ohmic resistance), the required potential 
rises along with the current, so that the current returns to the equilibrium value, 
since the battery is incapable of giving the required E.M.F. when the current 
increases. One must therefore make certain, by inserting a conductor having 
a rising characteristic, that the characteristic of the entire system is an increas- 
ing one. 

6. The Origin of Cathode, Canal, and Positive Rays. The Mass Spectro- 
graph. 

Having obtained a broad view of the phenomena of the glow dis- 
charge, we are able to understand the origin of the various rays which 
are found in a discharge tube under favourable conditions. It must 
be remembered that there is a strong held near the cathode, and that 
the positive ions attain a considerable velocity in this field. The cathode 
rays, recognizable at low pressures by their fluorescent effects, are 
a part of the electrons released from the cathode by positive ion impact, 
viz. those which have traversed the entire potential fall without ap- 
preciable energy loss by collision. But in order to maintain the dis- 
charge, it is necessary that a certain number of the liberated electrons 
produce new electrons and ions by impact in the gas. 
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The positive ions which have attained high speeds in the cathode 
drop can be made visible in the form of rays by providing the cathode 
with small holes and making the space behind the cathode field-free. 
Part of the ions then pass through the holes in the form of “ canal 
rays ”, where they gradually lose their speed by impacts with the 
randomly-moving particles of the gas. The principal difference be- 
tween canal rays and cathode rays is that the positive ions consti- 
tuting the former are self-luminous, while the electrons are capable 
only of causing other substances to radiate. The Doppler Effect 
(p. 235) may be perceived in the fight of the canal rays, as first recog- 
nised by J. Stark. As in the case of cathode rays, the velocity v and 
specific charge e/M may be determined by electric and magnetic de- 
flection. The results show — as may be expected — that the speeds are 
not alike, since the distances traversed by the various particles vary 
considerably. 

Rays of positive ions can be generated at the anode also if con- 
ditions are so arranged that a greater anode fall results, and if the 
anode is made of a material capable of emitting positive ions, e.g. a 
metallic salt. As mentioned above, the smallness of the anode fall is 
due to the high speed of the light electrons. If now the inertia of the 
negative charges is artificially increased by attaching them to atoms, 
the resulting field at the anode must be similar to that before the 
cathode. This loading of the electrons may be promoted by intro- 
ducing electronegative atoms into the gas, i.e. such atoms as the 
halogens, which have a tendency to form negative ions. This is done 
in practice by using a halogen salt as anode; the cations of the salt 
pass off as positive rays while the halogen evaporates as a neutral atom. 

The determination of e/M for canal rays and for positive rays is 
of particular importance, for it makes possible the determination of 
the atomic weight of individual ions. It is assumed, of course, that 
the charge on the particle is known; this must consist of an integral 
number of elementary charges, and there is usually no doubt as to 
the exact number. We see from equation (8) (p. 427) and equation (15) 
(p. 429) that the electric deflection is the same for all particles which 
have traversed the same potential drop, while the magnetic deflection 
depends upon e/M as well as upon the potential difference V. This 
circumstance may be used to overcome the troublesome lack of homo- 
geneity of the canal ray particles which, of course, have fallen through 
vanous potential differences. By means of an electric field, the par- 
ticles are first sorted according to V ; then, by means of a slit S z (fig. 5), 
a narrow region containing only ions which have traversed the same 
potential drop is isolated; finally, by means of a magnetic field, the 
particles are sorted according to mass. This is the scheme of F. W 
Aston s “ mass spectrograph ” The sharpness of separation depend^ 
upon the homogeneity of the rays, which, in turn, depends upon the 
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narrowness of the slit; this is limited by the fact that the intensity of 
the beam must not be made too low by undue reduction of the slit 
width. Aston’s idea was to focus the rays in such manner that rays 
within a given range of potential difference would be united at the 
photographic plate which serves to record them. Integration of equa- 



tion (5) (p. 427) gives the angular deviation in an electric field of length 
a, provided the angle is small: 

Ea (35) 


dy eEa 


dx 


Mv 0 2 


2V* 


and the magnetic angular deviation in a field of length b is, by (15), 


p. 429, 



dy __ I e Hb 
dx V M o\/2V‘ 



Call the distance between the mid-points of the electric and magnetic 
fields ^ and let the distance travelled by the rays from the middle of 
the magnetic field to the detector be l 2 . According to Ex. Ill (p. 431), 
we may consider the rays to come in a straight path from the centre 
of the field. The entire deflection, measured in a plane normal to the 
original direction of motion, is then given by 

h = (li - {- l 2 )a + • • • • (^7) 


If it happens that the potential drop traversed by the rays is not 
strictly constant, but varies by an amount dF, then the angles of de- 
flection he in an interval 





as may be seen by logarithmic differentiation of equations (35) and 
(36). This means that rays within this interval which have fallen 
through different potential differences but have the same e/M form a 
pencil of cross-section 

dh = [ft + l 2 )a + l f\ . ■ 


. . (39) 
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This cross-section is reduced to zero by making the expression in 
brackets vanish. This can happen only when a and f3 have opposite 
signs, i.e. when the magnetic deflection is opposite in direction to the 
electric. Set ft — — y. Since Zj is fixed, and since a also has a fixed 
value for a given potential, the only remaining variable is the distance 
l 2 from the mid-point of the magnetic field to the point where the rays 
are received. The bracketed expression set equal to zero gives the 
equation of the focal curve in polar co-ordinates Z 2 , y — i.e. the curve 
on which the pencils corresponding to various values of ejM have 
zero cross-section. This equation is 

7 2aZj 

2 “^3T 2 a 



The polar axis is the direction of the magnetically undeviated ray 
(see fig. 5). If we choose a new axis making an angle 2a with this one, 
the angle measured from the new axis is 6 = y — 2a, and the equation 
of the focal curve takes the simple form 


l 2 d = l 2 sin0 = 2Zja 



This is, in fact, a straight line parallel to the new polar axis and dis- 
tant from it 2aZj. This line is the intersection of the photographic 
plate with the plane of the figure when the plate is properly placed to 
receive the rays of a given ejM in sharp focus. The magnetic field has 
a dual function: it separates the rays corresponding to various masses 
and unites (focusses) the rays corresponding to particles having the 
same mass but originating at different points in the cathode drop. 

This classical ” example of a mass spectrograph, since surpassed 
by other designs, is one of the first instances of the use of electron optics . 
The term focussing ” has been current ever since. Electron optics is 
based on the formal correspondence between light paths in various 
refractmg media and particle orbits in fields of force (cf. p. 787 et sea ) 
Following this analogy through, individual fields of force may be com- 
puted as lenses (as m Ex. 112, p. 431), which are the basic elements 
of electron-optical instruments such as the well-known electron micro- 

scope* 



CHAPTER XXTV 

The Fundamental Ideas of the Theory of 

Metallic Conduction 

1. Electrons as Carriers of Current in Metals. 

There are only a few solids which, like Cul, exhibit good electro- 
lytic conductivity. The current in such substances is carried by 
material ions, as may be seen by the deposits formed at the electrodes, 
xhe conductivity of that class of substances known as metals is many 
orders of magnitude greater — in fact, an essential characteristic of 
a metal is high electric conductivity unaccompanied by any transport 
of matter. It is therefore likely that the current is carried by elec- 
trons which pass through the meshes of the atomic or ionic lattice. 
From the validity of Ohm’s Law for metals we conclude, as in the case 
of electrolytes, that the reaction between the electrons and the lattice 
particles may be looked upon as a frictional force rv proportional to 
the velocity, so that the relationship 

— eE = rv (1) 

is again valid.* The direct proof that the current is carried by electrons 
in metals was given by Tohnan in his determination of the value of 
e/m for the conduction electrons. Since an electron has inertial mass, 
this ratio must be determined from experiments in which the inertia 
of the electrons manifests itself. One might imagine, for example, an 
arrangement in which a metallic conductor is freely suspended. If 
the conductor is at rest while a current is flowing, and if the current 
is suddenly interrupted, the electrons continue to possess a certain 
amount of momentum which is destroyed by friction within the body. 
According to the principle of the Conservation of Momentum, the body 
as a whole must then experience a displacement. The opposite effect 
shows more promise of verification, for practical reasons: mechanical 
acceleration of the conductor should cause relative motion of the 
electrons, and therefore a current. Since we have very sensitive in- 
struments for measuring feeble currents, this method is easier to 


• Tho negative sign arises from the negative charge of the electron. 
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apply; Tolman actually determined e/m in this way (1914-23). We 
examine the case of an infinite straight cylindrical conductor which 
experiences a constant acceleration a in the direction of its length. 
According to p. 231, each electron is subject to an inertial force 
-ma, and under the influence of this force and friction it attains 
a velocity 


ma 

v — 

r 



If there are N (negative) electrons per cubic centimetre, the density of 
the resulting current will be 




Nem 

a 

r 



Now, on account of the relationship 

i = — Nev, 


or 

Bince we have always 
we find 



. ( 4 ) 

. (5) 
• (6) 


Hence the current which results from the acceleration is given by 
the equation 

. am 

Z =-e a V) 


. -^ su Jp e now that in place of a straight conductor we have one of 
circular form that we accelerate by oscillating it around an axis normal 
to the plane of the circle. While it is true that the acceleration is no 
longer constant, (7) may be applied at any instant, understanding a 
to be the instantaneous acceleration. The direction of a, and hence of 
*’ is understood to be along the tangent. We then have circular cur- 
rents, and the oscillation results in a magnetic field which varies with 
time. Its induction effects on a secondary coil may be computed by 
he results of p 312, exactly as for a normally produced alternating 
current Since (7) contains ejm as the only unknown, Tolman was able 

to calculate its numerical value. Within the limits of error, it agreed 
with that for free electrons. 6 


The next question is: “ What is the state of the conduction elec- 
toons in metals ? The simplest and most fruitful assumption is that the 
electrons move about through the framework of atoms, like atoms of 
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a gas. We have reason to believe that the number of free electrons is 
of the same order of magnitude as the number of atoms, so that the 
obstacles between which the electrons swarm are atoms with positive 
charges. This concept of an electron gas was used in Chap. XXIII, 
§ 3 (p. 431) to obtain a picture of the process of thermal emission 
of electrons. With this concept as a basis, we shall endeavour to 

obtain more information about the formally introduced frictional 
constant r. 


2. Derivation of Ohm’s Law for Metals. 

As a result of the heat content of the metal, the conduction elec- 
trons within the metal are in a state of chaotic thermal agitation, 
exactly as if they were atoms of an ordinary gas. Let the mean velocity 
of this motion be v w . For simplicity in the calculations we deal with 
a single average velocity, while actually the velocities have a certain 
distribution about this mean value, as in the case of a gas. If an ex- 
ternal electric field is applied, a component which is along the field 
and which is the same for ail electrons is added to the disorderly 
motion. The electron will not continue to take energy from the field 
indefinitely, but will soon lose the acquired kinetic energy by collision 
with an atom. We shall refer to a collision in which an electron gives 
up all its acquired energy to an atom as an effective collision. In 
addition, there may be elastic collisions which we need not consider. 
We call the distance traversed between two successive inelastic impacts 
the mean free path l. As long as we may regard the additional velocity 
imparted by the field as small compared with the thermal motion, 
the time between two inelastic impacts is given by 


l 


T = — . 


( 8 ) 


By Newton’s second law, the velocity attained in the field after r sec is 


eE r 


v,= — 


• • 


m 


. . . ( 9 ) 


Hence the average directed velocity component between two col- 
lisions is 

eEr eEl 




2 v w m 


( 10 ) 


On account of the complete disorder of the thermal motion this effects, 
on the average, no transport of charge. In computing the current 
density, then, only the additional velocity acquired in the field need 
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be taken into account. If there are N free electrons per cubic centi- 
metre, the magnitude of the current desity is 



e 2 NlE 
2 mv v 




Since none of the factors composing a depends upon the field E, we 
have here the proportionality between current density and field 
strength, which is equivalent to Ohm’s Law. 

With regard to the temperature dependence of N, l and v Wi we may 
say the following: According to our ideas concerning the structure of 
metals, N must in any case be temperature-dependent. Further, con- 
sidering the electrons to form a monatomic gas, we have, according to 
the statistical law of equipartition (cf. p. 596), on the average, at a 
temperature T 

= \hT % (12) 


where h is the Boltzmann constant, or “gas constant per molecule ”, 

R/L. Thus v w increases with y/T. If we make use of the experimental 

fact that the conductivity of all pure metals is inversely proportional 

to the absolute temperature, we can conclude that l decreases with 

Wf. 80 the product lv w is independent of the temperature. 

Qualitatively, we can see at once that the temperature affects each of 

these quantities oppositely: an increase in temperature causes the 

lattice elements to vibrate with greater amplitude. This increases the 

number of collisions between electrons and atoms, causing a reduction 

of the mean free path. On the other hand, the mean thermal velocity 

increases. However, according to the newer theory (cf. next section), 

v „ must be essentially independent of temperature and l must decrease 
as 1/T. 


3. Conduction of Heat in Metals; the Law of Wiedemann and Franz. 

The fact that the thermal conductivity of metals is many orders 
o magnitude greater than that of solid dielectrics suggests that this 
property, too is determined by the free electrons. If this is so, we can 
compute the thermal conductivity of metals on the same basis of an 

For fi? hTm f- ? mt0 relation electrical conductivity. 

ni lrlu v 6 0f , 8U ?^ clty ’ T e agam ignore the Mistical distribution 
of velocities and of free paths and use the single magnitude v, corre 

spondmg to the quantities denoted by above, and the single value 

Ti n0t ]UStified 111 ‘“^g that the com- 
^onen^s of t; are equal for all electrons. Assume a temperature gradient 

Srectio^u! 1011 f thS ^ aZ1S * • Consider tlle electrons passing^ both 
directions through a cross-section at a height z. We assume that the 
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energy of these electrons corresponds to that of the layer in which 
they experienced their last collision. If we denote the energy of an 
electron at height z 0 by u 0 , and call the temperature T 0) then 


«*(*) = “o + 


\dz) o 2kl ° 


, 3 idT 

+ * k Tz z ‘ 


(13) 


The electrons passing through each square centimetre of the sectioD 
Zq every second transport a total amount of energy 


S= £ n v v t t/ 0 + (—'j z , 


(14) 


where n Vz is the number of electrons per cubic centimetre which have 

the z-component of velocity v z . The summation is to be extended 
over all v z which occur, and over all possible layers z. These, however, 
are not independent. If we have uniform velocity v and if we replace 
the distance between the plane z = 0 and the place of the last col- 
lision by the mean free path, we have 


cos(^u) = cos 9 = — = ?.* 

v l 


. . . . (15) 


Again, the expression 'Ln v v z vanishes, otherwise there would be an 
electric current — eZn v v x flowing through the section z=z 0 . The 
thermal energy transported each second is 

z 2 du _ , Jcn v vz 2 dT , . „ , dT 




l 


_ = | kNvl ^ cos 2 8 . (16) 

dz dz 


Now the spatial average of cos 2 6 is 1/3; thus the energy flux is given by 


S=i kNlv 


dT 
dz ’ 


(17) 


The thermal conductivity is defined by 


S = a w 


dT 

dz* 


and so 


(7*= \kNlv, 


(18) 


Here again we do not know the manner in which the separate factors 

* It seems at first remarkable that we may put the mean free path l for the distant 
between the planes z = z and * -= z„, since all the electrons do not experience the U 
next collision in the plane » - a*. This will be discussed in more detad on p. 570. 
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depend upon the temperature. However, if we form the ratio to the 
electrical conductivity, a simple expression results: 



kNlv . 6/cT 
* e~Nlv 





This expresses the law of Wiedemann-Franz, which states that the 
ratio of thermal to electrical conductivity is ‘proportional to the absolute 
temperature. This law is well substantiated experimentally, and the 
factor of proportionality agrees, within certain limits, with the value 
3 (R/Ff. 


4. Objections to the Theory Developed Above. The Electron Theory of 
Pauli and Sommerfeld. 

Although the concept of an electron gas accounts for many pheno- 
mena of metallic conduction, the fact that some of its consequences 
lead to gross contradictions of experience must not be overlooked. 
Primarily, there is the question of the contribution of the electrons to 
the energy content, and hence to the specific heat, of the metal. 
According to the Law of Equipartition of Energy (p. 596), there 
corresponds to every degree of freedom which contributes to the 
energy of a system an amount of energy kT /2 per molecule. Assuming 
as above that the number of free electrons is of the same order of 
magnitude as the number of atoms, the energy content of the metal 
must be increased by (3/2 )RT per mol, corresponding to the three 

components of the translatory motion of the electrons. Then the 
specific heat per mol, 



does not have the value 6i?/2 found for higher temperatures (Law of 
Dulong and Petit), but amounts to 9JJ/2, which is completely at 
variance with experience. This was first cleared up by recognizing 
that the electron gas has an enormous number of particles per unit 
volume— one electron per atom, or as many individuals per unit 
volume as there are atoms of the solid metal. Thus it is not legitimate 
to treat an electron gas by ordinary statistics. The theory developed 
by Pauli and Sommerfeld will be discussed later (Chap. XXXVIII). 


% 
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CHAPTER XXV 


Electron Theory of the Dielectric Constant, 

Index of Refraction and Magnetic Permeability 

1. The Origin of Electrical and Magnetic Polarization. 

While the dielectric constant and the permeability (or the electric 
displacement vector D and the magnetic induction vector B) are the 
quantities used in the field theory, they have no imm ediate physical 
interpretation in the atomistic theory of electrical phenomena. It is 
the polarization, i.e. the resultant dipole moment per cubic centimetre, 
whose interpretation can be given by the atomistic theory. The 
polarization P — in the magnetic case, the magnetization M — depends 
upon D and E or upon B and H according to the relationships given on 

pp. 280, 298: 

D = E + 4tt-P, B = H + 4ttM. 

The polarization vector is parallel to the field vector for isotropic 
bodies; for anisotropic substances (crystals not belonging to the 
regular system) P is a linear vector function of E. We restrict our 
present considerations to isotropic substances, for which the con- 
nexion between P and E or between M and H is given by a simple 
scalar magnitude — the susceptibility k. 

A resultant dipole moment can originate in two ways in either the 
electrical or the magnetic case. Firstly, the molecules of the sub- 
stance may themselves possess a moment. In this instance the axes 
of the dipoles are randomly oriented, so that there is no resultant 
moment. Application of an external field causes the particles to assume 
the direction of minimum energy, i.e. to set themselves parallel to the 
field. This adjustment, however, is opposed by the thermal agitation, 
so that the alignment can become perfect only for an infinitely strong 
field or when the temperature is reduced to absolute zero. It is evident 
that this kind of susceptibility depends strongly upon the temperature. 
In the magnetic case the term “ paramagnetic ” has been applied for 
a long time; in the electrical case the differentiation is less obvious (see 
below), and so this distinction has come to be applied only recently. 
The corresponding term “ paraelectric ” or “ parelectric ” is there- 
fore not yet in universal use. 
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The other possibility is that the dipole moment of an individual 
molecule is induced by the field itself. This process is easy to compre- 
hend in the electrical case. The electronic shells are not rigidly attached 
to the core, so that an electric field causes a relative displacement of 
the two. On account of the fact that the charge on the electron is 
negative, the shells are moved in a direction opposite to the field, but 
since the dipole moment is defined as the limit of the product of charge 
by separation, multiplied by the unit vector drawn in the direction 
from the negative toward the positive charge, the moment itself has 
the direction of E. Hence the resulting “ dielectric polarization ” is 
in the direction of E. In the electrical case the dielectric and par- 
electric polarizations have the same sign. However, since the dielectric 
polarization does not involve molecular orientation, it does not depend on 
the temperature. Dielectric polarization occurs also for parelectric 
substances, since a relative displacement of positive and negative 
charges can occur here also. The distinction between the two forms 
will become clearer in the following paragraph. 

The origin of “ diamagnetism ** is somewhat more complicated. 

As shown on p. 664, an applied magnetic field imparts to the electrons 

an additional rotation about the direction of the field. This so-called 

Larmor precession takes place with angular velocity to = eH j2mc. 

This motion of the electrons gives rise to a magnetic moment whioh, 

as exact calculation shows, is in a direction opposite to the field. Thus 

the diamagnetic and paramagnetic susceptibilities are opposite in 

direction, and so the distinction between the two forms in the magnetic 

instance is the well-known one. The quantum theory requires that the 

paramagnetic moment of an atom be a multiple of a certain magnitude, 

the magneton; this is so large that for paramagnetic substances the 

diamagnetic susceptibility which is always present is relatively in- 
significant. 


2. Theory of Dielectric Polarization, Optical Index of Refraction and 
Dispersion. 

Assume that the substance under consideration contains N atoms 
or molecules per cubic centimetre, these particles themselves having 
no dipole moment. Suppose further that the distances between par- 
ricles are so great that their mutual influences may be neglected In 
a held E every particle assumes a moment, which we may take as pro- 
portional to the field, for we are dealing with small displacements of 
the electronic systems, and may take the restoring forces to be pro- 
portional to the displacements. These forces hold the system in equili- 
brium m the field; hence the elongation, and thus the dipole moment 
is proportional to the field strength. We then have 


t>= “E, 


( 1 ) 
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and the polarization becomes 

P = kE = Na E (2) 

The dielectric constant is then given by 

K = n 2 = 1 -f 4~A r a, ( 3 ) 

where the Maxwell relationship K = n 2 is assumed (p. 331). Inasmuch 
as we shall limit our present considerations to matter of very low 
density, we may replace (n 2 - 1) by 2(n — 1) and obtain an approxi- 
mate formula for the refractive index: 

n — 1 ^ 2ttN a (3') 

Since we assume a quasi-elastic restoring force, the application of an 
alternating field will cause displacements of the electron shells which 
depend upon the frequency. Assuming further that the polarization 
of each atom is caused by the displacement of an individual electron 
from its position of equilibrium, the equation of motion of the electron 
is that of a forced harmonic vibration: 

d 2 s 

m + ks = —eE 0 e iuit * (4) 

The stationary solution is, according to p. 97, 

/> /» *>Iwf 

s = - < 5 > 

or introducing the natural frequency o» 0 = Vk / m , 

.= ( 6 ) 

m (oj 0 “ — or) 

Hence the dipole moment becomes 

± j .it (e 2 / m)E 0 e iu,t 

= (7) 

cuq~ — or 

and the polarizability is given by 

„ _ _ e7'» ,su 

a 2 2 * •••••• 

(O 0 — CO 

If the number of electrons is / instead of one, the value is / times as 
great and the refractive index is given by 

n-l = (9) 

2tt( I' 0 ” — V-) 

where co has been replaced by 2itv. 

* The effect of the magnetic vector of the light wave, given by equation (9) (p.428 ) 
may be neglected, since the resulting electron velocities are small with respect to c, 
while E and H are of the same order of magnitude in our system of measurement. 
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Even t his simple dispersion formula contains all the essential 
properties of the optical index of refraction. The characteristic fre- 
quency v Q is to be interpreted as the absorption frequency of the atom, 
since for this applied frequency the response of the atom is so great 
that all energy is taken from the primary ray and scattered in every 
direction. Now the absorption lines of free atoms lie almost ex- 
clusively in the ultra-violet region of the spectrum. Hence if we trace 
the course of the index in the visible region, v < v 0 , and so the re- 
fractive index is always greater than unity. As we pass to shorter 
wave-lengths the frequency increases and we approach the charac- 
teristic frequency; the difference v 0 2 — v 2 becomes smaller and smaller 
and the index increases toward short waves. This is the usual be- 
haviour for glass and for liquids, and is called normal dispersion. 
Beyond the absorption band the index becomes less than unity accord- 
ing to equation (9), and so the rays in that region are refracted less than 
the longer ones. This reversal of the usual prismatic sequence of 
colours was first observed by Christiansen, using fuchsine, and is 
called anornalqus dispersion, although from the point of view of the 
theory there is no anomaly whatsoever. 

The dispersion formula may be improved in several respects. 
Firstly, the fact that an atom or molecule may possess not one, but 
perhaps a large number of absorption lines must be taken into con- 
sideration. Each one contributes an oscillator whose strength is pro- 
portional to the intensity, so that we write 


.-i 


(V - v) 


• . ( 10 ) 


Moreover, the oscillations of the quasi-elastically bound electrons 
are not undamped; the radiation resulting from the oscillation (see 
p. 341) involves a certain damping which prevents the index from 
actually becoming infinite at the absorption lines, as in equation (10). 
If the damping is taken into account, the index becomes complex and 
the substance behaves like a metal near the regions of absorption 
We shall not treat this case in more detail here. 

It is more important to take into consideration the mutual inter- 
actions of the dipoles, which cannot be neglected when we consider 
denser substances, such as liquids or solids. In such cases there is a 

Th;=°fi°w 1C - fie d . at the plaoe occu P ied by any selected electron. 
•Lins field is not the same as the macroscopic field E defined, for 

example, by the difference of potential across a condenser. But in 

any case the susceptibility is defined as the ratio of the polarization 

to the macroscopic field strength. The difference between E and P is 

that all the charges contribute to the macroscopic field, while in 

h 6 d ^ tt6 dipole underP c^deration, the 

latter is not to be included (of. footnote, p. 266). In order to calculate 
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the field of neighbouring dipoles, which is added to the applied field 
we circumscribe a small sphere about the point occupied by the atom 
under consideration and examine first the contribution of the dipoles 

^tlfnTl tSlde t the s P h * re ~ le - we imagine aU material removed from 
hin the sphere. There is a surface charge upon the sphere, and the 

additional field has the value + 4 w P/3, according to p. 283, where we 
had the same surface charge, but with opposite sign (see also Ex. 75 
P; According to a calculation made by Lorentz, the contribution 
Of the particles within the sphere vanishes either when there is com- 
plete absence of order or when the atom under observation is a point 
of a regular crystal lattice, i.e. for all isotropic bodies. Then the dipole 
moment of the individual atom or molecule is 

^ =a ^ E +Y p ) (Hi 

and we obtain the equation 

P=AT a ^E + ^p) (12) 

for the polarization. Solving this equation for P yields 

— = • • • W) 

l ~Y Na 


e + T p 


P = 


E + y p 


. . . 

') 


• • ( 11 ) 


. . ( 12 ) 


. . (13) 


Solving for Na shows that the simple relationship between N , a and 
K (or n 2 ) given by equation (3) (p. 452) is to be replaced by 

4t7 K — 1 n 2 — 1 

T Na ~ K +2 = ^+2 (14) 

Hence for denser substances the dispersion formula is 

« 2 - 1 _ Ne*lm f, 

« 2 + 2 3jt ' v 2 - v 2 U J 


4 77 , 7 K — 1 

T Na= K+2 


n 2 — 1 __ Ne 2 jm v 
^+"2 3 i “ Li 


(15) 


As may be verified immediately, this formula becomes identical with 
equation (3) for n & 1. It is customary to deal with one mol of the 
substance, rather than with 1 c.c. Let the density be p and the mole- 
cular weight M. Then 1 gm. contains L/M molecules, and 1 c.c. con- 
tains 

N=^ (16) 


(16) 


Using this expression in equation (15), we obtain 


n 2 - 1 M 
n 2 + 2 * p 


Le 2 !m 

3 77 


2, 


v 0 2 - v 2 ' 


• • (17) 
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The quantity on the left, which has the dimensions of a volume, is 
called the molecular refraction. 

Extrapolation of equation (15) to infinitely long waves, i.e. to 
v = 0, gives the following equation for the static value of the dielectric 
constant: 

K—l _ Ne 2 /m v /1KM 

K + 2 3tt 2j * u q ? (1 } 

For many substances this does yield the true value, but for others — 
e.g. water — there are large discrepancies. The reason is that this 
method gives only the dielectric part of the polarization. As will be 
seen in the next section, the parelectric portion vanishes for optical 
frequencies, on account of the fact that the charges are not able to 
follow the field at such frequencies. Thus substances for which (15') 
gives the correct value of K are free of dipoles. 

Ex. 113. The refractivity of a substance of mass m and density p is, in anal- 
ogy with the molecular refraction given by the expression 

P n % — 1 m 

~ r^T2' T 

Show that the refractivity of a mixture is the sum of the refractivities of the 
components. 

Ex. 114. Formerly, the molecular refraotion was often taken to be the aotual 
volume occupied by the molecules. As a matter of faot, one obtains numerioal 
values which agree well with the kinetic theory values. Show that this is so, 
for example, for water ( n = 1*33). Show also that the numerical values of polariz- 
ability are those of metallic spheres of the gas-kinetio size (of. p. 284 ). 

Ex. 115. Calculate phase velocity and group velooity of eleotro-magnetio 
waves in a medium containing N free electrons per unit volume (Heaviside Layer 
of the upper atmosphere). 

3. Parelectric Susceptibility. 

As explained in § 1 (p. 450), the application of a field to a substance 
whose molecules possess an electric moment generates polarization by 
lining up these dipoles. This phenomenon occurs only if the mole- 
cule are free to move— i.e. principally in gases and in liquids— and 
if the external field reverse slowly enough to permit the system to 
come to equilibrium between reversals. The pareleotrio contribution 
which is reponsible for the high value of the dielectric constant of 
water, vamshe at frequenoie corresponding to wave-lengths of a few 
centimetre. In order to find the extent to which the dipoles are 
aligned m terms of the temperature and field strength we make use of 
the statistical theorem (p. 588) that the number of partioles of a system 
wnicn are in a state of energy u is proportional to 
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where Jc is the Boltzmann constant. Now the potential energy of a 
dipole <p in a field E is given by 

u = — pE = —pE cos# (18) 

If we denote the direction of the dipole axis by a point on the unit 
sphere, the number of these points which lie in the zone 9 to 9 -f d9 
(6 = co-latitude) is proportional both to the area of the zone and to 


— uikT 


We may therefore set 


dN = Cef >Eco&(, ' kT sin Odd, 


. . . (19) 


the factor 27r being contained in C. The value of this constant is 
determined by integrating to obtain the total number N of particles 
per unit volume: 

N = C [ n e‘> Eco * 0lkT sinddd (20) 

•'o 

Make the substitution 


pE cos 6 

= W \ 

kT 


then 


„ n kT r + P ElkT , 2 CkT . .pE 
N=C -=, / e w dw= — jr-sinh^R. 

pE J-pEtkT V E kT 


( 21 ) 


pEIkT 


Solving for C and using this value in equation (19), we havA 

dN = — NpE - e »Eco.«/*rg ip ede. . . 


2 kT sinh 


pE 

kT 


( 22 ) 


If the axis of a dipole forms an angle 9 with the field, its contribution 
to the resultant parallel to the field is p cos 9. The components normal 
to the field cancel out in summing over all dipoles, and we obtain the 
resulting dipole moment per cubic centimetre, i.e. the polarization, by 
integrating over all directions: 


p __ E J e t>Ec<» 0 i kT cos9 sin 9d9. 


2kT sinh^l*' 0 


( 23 ) 


Making the same substitution as above, this becomes 

NpkT r + > EbT WT(»e»-e») 

P— 


2pE sinh^^ J pElkT 


e w w dw = 


+ pE'kT 
—pE'kT 


vE 

2pE sinh^ 
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Following Langevin, to whom this theory is due, we have designated the 
function coth x — 1/* by a single symbol, say L(x ). We see that L(x) 
approaches unity for large values of x\ a power series development 
shows that the function behaves like x/3 for small values of x. Since 
Np is the ma xim um value (saturation value) of the polarization, 
attained when all the dipoles are aligned with the field, the limit must 
be unity for large values of the argument if the result is to have any 
meaning at all. For weak fields and for temperatures which are not 
too low — conditions which are satisfied in most experiments — the 
approximation xjZ is sufficient and, since P is always in the direction 
of E, we obtain 

« Np 2 „ 

P — 3 kT E ( 25 ) 


That is, within certain limits, the parelectric susceptibility is also 
independent of the field, and we can apply the formulae of § 2 
if we replace a by p 2 /3kT. Ta kin g account of the interactions of the 
dipoles, equation (14) (p. 454) then gives for the parelectric part of 
the dielectric constant 

K — 1 47 tNv 2 

ir+2 9 W (26) 

and likewise the molecular polarization 

K - 1 M_4tt Lp 2 

K+2' p ~ 9 kT < 27 > 


In deducing the formula for the parelectric polarization we made 
the natural assumption that the dipoles could assume all possible 
orientations with respect to the field. According to the principles of 
the Quantum Theory, however, only a discrete set of angles of orien- 
tation occurs. Nevertheless, the strict quantum theory calculation 
yields only small deviations from the classical Langevin formula. Of 
much greater importance is the directional quantization ” in the 
magnetic case; this will be treated below. 

. ? n a< i C ? Unt , of * he fa . ct that the adjustment cannot take place 

e ^ blt pare ec ‘ nc susc eptibility only in the neighbour- 
dwfren+ tbe meltlD S'P° mt ' Nevertheless, crystals composed of ions of 
different signs can show a different kind of polarization which is due 

to relative movement of the ions as entities. This polarization also 
vanishes at optical frequencies. 

Ex. 116. Given the dielectric constant K = 80 for T = 290° and the optical 
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index of refraction n = 1-33, calculate the dipole moment of the water molecule 
assuming that the dielectric and parelectric polarizations are additive. 

4. Paramagnetic, Ferromagnetic and Antiferromagnetic Susceptibility. 

Molecules are the only particles which can possess electric moments, 
since for all atoms the centre of gravity of the negative charges coincides 
with that of the positive charges. On the other hand, a free atom can 
have a magnetic moment. There is another point of difference between 
paramagnetism and parelectricity: from the fact that the paramagnetic 
susceptibility of salts does not change much when the salts go into 
solution we may conclude that the solid state offers no hindrance to the 
alignment of the dipoles. 

If we now assume that all orientations are equally allowable, we 
need only translate the formulae of § 3 (p. 457) into magnetic terms, 
with the understanding that p now represents the magnetic moment 
of the atom or molecule. Neglecting the interaction of the dipoles we 
have, according to equation (25) (p. 457), the following expression for 
the magnetization: N 2 

- ~ P H (28) 


M = 


3 kT 


The susceptibility per unit volume is then given by 

,2 


Np 

K = 


3 hr 


(29) 


Division of k by the density p gives the susceptibility per gramme; if 
we then multiply by the molecular weight G we obtain the important 
molar susceptibility 

kG Lp 2 M 2 C 
X ~ o ~ZkT~3RT~ T 


(30) 


Here M represents the dipole moment per mol, i.e. the moment of one 
mol when the dipoles are all completely aligned; and C is called the 
Curie constant, after P. Curie, who found the temperature relationship 
given by (30)* The experimental determination of G consists in 
finding the temperature variation of x‘> from this the molar magnetic 
moment is found by the equation j* 

M = a/3M (31) 

P. Weiss found J that the molar magnetic moments of a large number 
of substances determined in this way were all multiples of a certain 

value M w = 1125-5 e.m.u., 


* P. Curie, Journ. d. Phys., 4, p. 197 (1895). 

f Although we now know that this evaluation is not applicable, in order to have 
convenient comparison between theory and experiment we quote the results in terms 
of “ effective moments ”, i.e. moments as given by (30). 

J P. Weiss, Comptes rendus, 152, pp. 79, 187 (1911). 
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now called the Weiss Magneton. Nevertheless, the limi ts of error in 

these experiments are quite wide. Remarkably enough, the Bohr 

theory of the atom also yields a magneton whose value referred to 

one mol is, within the limits of error, exactly five times the Weiss 

magneton. From the standpoint of the quantum theory — upon which 

Bohr s theory is based — it is not at all to be expected that we obtain 

a whole number of Bohr magnetons when evaluating the observed 

Curie constants, for it is a characteristic of the quantum theory that 

only a certain set of orientations of the dipole axes with respect to the 

field is possible. This will be discussed in further detail in the seventh 

part of this work (p. 665). Here we investigate only the especially 

important case where a moment is allowed to set itself only exactly 

parallel or antiparallel to the field. The distribution is governed again 

by Boltzmann s principle. If W + is the number of moments aligned 

parallel and the number antiparallel to the field, then because 
A*. -f AL = N , 



and so 


M — N+p — N_p = Np tanh 


(24a) 


. In Place of the function coth® - (1/®) we have tanh®. The trend 

°/n saturation cury e, but the initial slope is I instead 
ot 1/3. In what follows we need only interpret L(x) as tanh®, and the 

result corresponds to the quantum theoretical special case of two 
allowed orientations. 

The interaction of the dipoles may be taken into account by setting 
the effective microscope field F equal to the sum of the macroscopic 
field and a second quantity proportional to the magnetization: in this 
case, however, the factor of proportionality 4w/3 derived on p 454 
does not give a correct representation of the experimental data 
We therefore insert a general empirical constant v instead As 

17 -", ” “• )»•*“ “ K» b, ./> t £ 

Nv 2 

“ = (H + »' M ) (32) 

for the magnetization. Solving for M, we obtain 

111 = _ Np*H 

ZJc( T — vN P 2 \ 3 k(T — 0)’ * * ’ < 33 ) 

\ 8 * / 

where the quantity vNp*/3k, which has the dimensions of tempera- 
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The molar susceptibility then follows the 
_ M 2 

X 3 R{T - 0) (34 > 


If we make a graph of 1/^ as a function of the temperature, we obtain 
a straight line whose intersection with the axis of abscissae gives the 
characteristic temperature 0. If a positive value is found for 0, there 
must be a real temperature at which the susceptibility attains an 
extremely large value. This is actually the case for a small group of 
substances — the ferromagnetic metals. The characteristic tempera- 
ture above which these substances behave like simple paramagnetic 
substances is called their Curie point. However, for the high values of 



ferromagnetic susceptibilities we are no longer justified in applying 
our approximate formula, but must turn to the Langevin function. 
We have the following equations for M and H from which F is to be 
eliminated : , 

| M | = NpL < 35 ) 

| M | = - (30) 

V 

This may be done graphically as follows: Let the quantity x — pF/kT 
be the abscissa and take y = | M | /Np as ordinate (fig. 1). The value 
of the magnetization corresponding to a given field strength H is the 
ordinate of the point of intersection of the curve y = L(x) with the 
straight fine 

kT H 
y “ vNp 2 X vNp 

The unit along the axis of y is the saturation value Np of the mag- 
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netization. It is remarkable that under certain circumstances we 
obtain, for H — 0, two points of intersection besides 0. These are 
symmetric about 0 and correspond to a high value of the magnetiza- 
tion. This happens when the slope of the line is less than that of the 
Langevin curve, i.e. less than 1/3. This again gives exactly the Curie 
point for T. With ferromagnetic substances, a spontaneous mag- 
netization is possible below the Curie point even in the absence of a 
macroscopic field H. Thus we have traced the phenomenon of re- 
manence to the internal field. The high value of the constant v, how- 
ever, remains a puzzle; instead of the value 4 tt/ 3 it may be of the 
order of magnitude of 10,000 for 
ferromagnetic substances. 

Quantum mechanics traces v back 
to “ exchange energies ” (p. 712). 

For some substances, however, this 
may lead to negative values of v. In 
this instance, neighbouring spins set 
themselves in opposite directions as 
a result of the forces between them. 

Such antiferromagnetic materials (e.g. 
many compounds of manganese) are 
paramagnetic at higher tempera- 
tures. With decreasing temperature, 
the paramagnetism decreases in the 
usual way until the Curie point (here 
sometimes called the N6el point) is 
reached, at which time the align- 
ment sets in. This is accompanied 
by a sudden decrease in the para- 
magnetism. 

Even with the origin of the constant v explained, there are still 
many difficulties connected with an understanding of the actual mag- 
netization curve. It is well known that this curve has the form of a 
loop called the hysteresis loop (fig. 2). It follows that very different 
values of M correspond to a given value of H, depending on the pre- 
vious treatment of the sample. One of the first questions arising is: 

nn V 7 {,, 8I ? U fie d val “ e ?’ wh y does not magnetization jump to 

9 « th n tW j P0 ! nt3 of ! ntersectlon of th e curve and straight line in 
fig. 2 . Considerations of stability show that these points represent 
stable states whde the condition of zero magnetization is an uSle 
one. In reality the spontaneous magnetization deduced from fie 1 
corresponds to the condition of saturation attained with the highest 

L fi6ldS , Me S ° SmaU “ ««P— 

straight m pr ° duce ° nly W displacements of the 

straight lines m fig. 1 when represented on the correct scale. These 



Fig. 2 
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external fields obviously serve only t-o overcome certain constraints 
that limit the value of the magnetization, which is large even for zero 
field. Inasmuch as these constraints depend strongly on the material 
and on its previous treatment, a great variety of magnetization curves 
is obtained. 

The saturation condition may be changed by lowering the temper- 
ature, which corresponds to rotation of the straight lines. Measure- 
ments of the temperature-dependence of the spontaneous magnetiza- 
tion of iron and nickel are in accord with theory if the L curve in fig. 1 
is represented by tanhz. As will be seen later (p. 667), together with the 
gyromagnetic anomaly factor 2 discussed in the next section, this 
means that the electrons responsible for ferromagnetism have no 
orbital moment. However, it is not correct to conclude that these are 
conduction electrons, for in the salts of the entire iron group the mutual 
disturbance of the electron orbits makes these substances behave mag- 
netically as though the electrons were free (see p. 669 el seq.). 

The question raised above, as to why a considerable field is needed 
to produce spontaneous magnetization, is to a great extent answered 
by investigations on single crystals. The curves for various directions 
of magnetization in iron are given in fig. 3. These curves may be under- 
stood on the basis of the following considerations : There are apparently 
preferred directions of magnetization in a crystal. For the cubic lattice 
of iron, for example, this is the [100] direction — that of the edges of 
the cube. No distinction exists between a given direction and its 
opposite. Apparently at zero field there are sizable regions or domains 
oriented in one of the six preferred directions, but because of the com- 
pensating effects of opposite senses no external magnetic effects are 
noticed. When the field is applied, the domains snap into the field 
direction to the extent that it is along an edge. Since the directions 
are energetically equal, no expenditure of energy is required and the 
lining-up proceeds very readily as shown by the steepness of the curve 
for this direction. Of course, there may be a potential barrier between 
the two opposite directions, i.e. the lining-up process proceeds along 
directions corresponding to greater energy. As a result, the mag- 
netization curves are not strictly vertical at the start; also, many 
domains remain in their former orientation after removal of the field 
because the thermal energy is not large enough to surmount the poten- 
tial barrier. This is the way remanence is explained. If the field is in 
a different direction— for instance, that of the cube diagonals [111]— 
all domains line up in the directions of the edges lying nearest the 
field direction (fig. 4). Further increase of the resultant magnetization 
can occur only by the turning of domains away from the directions ot 
the edges, and this requires energy. As a result, the process of mag- 
netization goes easily up to MJV*, but beyond this point the curve 
must rise much less steeply. That this really is the case is shown by 
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fig. 3. Upon removal of the field the rotated domains will promptly 
go back to the edge directions, which correspond to minimum potential 
energy. This means that the turning process, unlike the reversal pro- 
cess, is free of hysteresis. For fields in the direction of the face diagonal 
the curve begins to bend over at MJy/2. 

For technical polycrystalline materials the actual curve of mag- 
netization is not given simply by superposition of the various crystal 
effects. Rather, an additional circumstance — internal stress — exists, 
and this also determines a preferred direction. The preferential direc- 
tion in a stretched iron wire is that of the tension. In a region of 

uniform internal stress the actual 



Fi«. 3 


magnetization vector orients itself 
in such way that the sum of the field 
energy, crystal energy and tensile 
energy is a minimum. Since the 
stresses vary from place to place, 
neighbouring domains differ in 



Fig. 4 


energy and the external field has the additional effect of making in- 
dividual domains grow at the expense of others. Quantitative cal- 
culation of these ideas is quite troublesome and the details will not 
be given here. It is sufficient to note that such considerations have in 
recent years led to an almost complete understanding of what seemed 
at one time to be a hopelessly confused subject. 


5. Magnetism Induced by Revolving Electrons. The Magnetomechanical 
Parallelism. Theory of Diamagnetic Susceptibility. 

basW^^ST’vi “1° the ° aUSe ° f th ® ma 2 net]c moment of an atom, 
asmg our considerations on our present concepts of atomic structure 

we amve at a vivid interpretation of the “ elementary currents ” which 

Ampere assumed were the origin of atomio magnetism. According to 

°^ 1> m 1 0de ™- pomt , Vlew - these currents are to be identified with^ho 
orbital motion of electrons about the nucleus. If the period of re- 
volution of an electron is r sec., its orbit is equivalent to a circular 
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current of strength -e/r. Taking the orbit to be plane and calling 
the enclosed area S, the magnetic moment becomes 



S 

r 



Since the motion of the electron takes place under a central force, the 
Law of Areas (p. 89) is valid, and we may replace S/r by dS/dt. Now 
the areal velocity is given in direction and magnitude by the me- 
chanical moment of momentum d divided by 2 m\ we then have for 
the magnetic moment 



This important relationship is called the magnetomechanical paral- 
lelism. To every magnetic moment there corresponds a mechanical 
moment of momentum whose existence is due to the inertial mass of 
the electrons. This can be experimentally verified. Since the con- 
servation of angular momentum requires this quantity to remain con- 
stant when there are no external forces, a change in the magnetic state 
of a body must impart to it a rotational motion whose angular mo- 
mentum is equal and opposite to that of the electrons. This pheno- 
menon is known as the Richardson-Einstein-de Haas Effect, and is 
actually observed. The magnetic state of the body may be altered by 
magnetizing it or by destroying its magnetism by heating it above 
the Curie point. Formula (38) may be checked by observing the values 
of magnetic moment and moment of momentum ; surprisingly enough, 
the factor on the right comes out double the value e/2 me for all ferro- 
magnetic substances; for salts of the rare earths, for which the effect 
is so small that it is just measurable, a value between one and two 
times this factor results. We shall take up this significant anomaly in 
detail later (p. 667). 

The inverse effect — magnetization by rotation — was first observed 
by S. J. Barnett in 1914, shortly before the discovery of the direct 
effect. 

If an atom has several electrons, it is possible for the mechanical 
angular momenta — and hence the magnetic moments — to mutually 
compensate each other. In such an atom, the field gives rise to a moment 
opposite in direction to the field; its magnitude will be calculated 
immediately. Naturally, this induced moment appears also in para- 
magnetic atoms, but is small compared with the moment already 
existing, and so may be neglected in general. According to the theorem 
of Larmor mentioned above and derived on p. 664 , the magnetic field 
brings about an added rotation of the entire electron system around 
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an axis through the nucleus parallel to H. The angular velocity of 
this motion is given in direction as well as in magnitude by 




This rotation gives rise to an additional angular momentum 

d — = mS[y f [wrJ] = mS(w . rf — r t . (40) 

The summation is to be extended over all electrons. The radius vector 
of the ith electron, referred to the nucleus, is Take the z-axis of a 
rectangular system of co-ordinates in the direction of the field. Then 
the z-component of this angular momentum, which is the only com- 
ponent having a non-vanishing resultant, becomes 

d t = moiL (r, 2 — r { 2 cos 2 0,) = mwT. (x? -j- yf). . (41) 

Since the field has no aligning effect, all orientations of the atom are, 
on the average, equally probable, and since x 2 y 2, -|- z 2 = r 2 , 

*? = y? = *? = y (42) 


Hence by (38) the average induced magnetic moment is 

_ e<a v 2 eFZr? jj 

P = P ‘ = -fc 2r * = -<5^H> • 

and the molar susceptibility is then given by 





As one sees, the negative sign of the diamagnetic susceptibility results 
correctly; in addition, the mean orbital radii computed from (44) 
agree well with the values obtained from other data. 



CHAPTER XXVI 

Phenomenological Theory of Superconductivity 

1. The Fundamental Equations. 

The superconducting state is characterized chiefly by the sudden 
drop m the electrical resistance of a conductor when it is cooled to 
temperatures near the absolute zero. This effect was discovered by 
H. Kammerlingh Onnes in 1911. The decline from the finite resistance 
corresponding to the temperature of the conductor to the immeasurably 
small value characteristic of this phenomenon takes place in an interval 
of only a few hundredths of a degree. The midpoint of this interval is 
called the critical transition temperature. For pure metals this lies 
between 0-1 and 10 K., while for metallic conducting compounds such 
as hydrides and nitrides of metals it falls somewhat higher. Since new 
superconductors are continually being discovered, it is feasible to state 
only the groups of elements among which no indication of super- 
conductivity has yet been found. These are the elements of the first 
column of the periodic table (the alkali metals and Cu, Ag and Au) 
and the ferromagnetic metals. 

The vanishing of the electrical resistance is not the only charac- 
teristic of superconductivity. A second circumstance, discovered by 
Meissner and Ochsenfeld, may be roughly described as the vanishing 
of the permeability. If a sample which is already in the superconducting 
state is brought into a magnetic field, current will be induced in the 
surface layers in such manner as to cancel the field within the sample. 
The external magnetic field behaves as if the material had zero per- 
meability. However, if the sample is cooled below the transition 
temperature while in the magnetic field, nothing should occur according 
to Maxwell’s theory. Meissner found, nevertheless, that exactly the 
same field distribution set in as in the first instance. One might expect 
to deduce the behaviour of superconductors by setting a— oo and 
fx == 0 in Maxwell’s equations. This does not work. 

Although no satisfactory model has yet been suggested, London and 
later von Laue found a way to supplement the Maxwell equations so as 
to represent correctly the phenomena and to describe them in terms of 
a single material constant which depends on the temperature. For the 
present, the Maxwell equations will be written as they stand, with e set 
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equal to unity for metals. We retain p for the permeability at the 
transition temperature, having in mind the fact that certain phenomena 
have not been completely explained, although the most recent results 
indicate that p may always be taken equal to unity. 

The Maxwell equations may now be written 


. _ 1 BE ( 4:771 

curl H = - q . . . 

c dt c 

. . . (i) 

curlE = -c Tt 

... (2) 

div E = 4:77 p 

... (3) 

div B = 0 

... (4) 

B = pH 

... (5) 


The new feature is that i and p are each to be broken down into a 
normal and a “ super ” part. The behaviour of superconductors in 
high-frequency fields shows that there is an ohmic contribution to the 
current at temperatures below as well as above the transition tem- 
perature. It appears that only a portion of the electron ensemble 
participates in superconduction, while the remainder behaves in normal 
fashion. Accordingly we take 

2 = P — Pn ~ b Pa* 

The following equation of continuity holds for each part independently: 


dp n 


div *'» + — 0; div*. + -|! = 0. 


• • ( 6 ) 


As before, we put for the normal current 

i n = oE. . 


( 7 ) 


From the fact that for an electron subject to no “ frictional ” re- 
sistance the acceleration will be proportional to the field intensity 
.London assumed J * 


0 

dt ^ = E ’ 


( 8 ) 


where A represente a new constant of the material which has a value 
dependent upon the temperature. 

,! 0r , the . r , el f tlon ° f fche super-current and the magnetic induction 
(field intensity), a relation is set up which, in principle, amounts to 
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the application of the considerations of p. 465 to the ease of macro- 
scopic circulation of current: 


c curl (Af s ) = — B ( 9 ) 

It should again be emphasized that these depictions by means of 

models are meant only to serve as heuristic principles and that it has 

not yet been found possible to incorporate them into a definitive theory. 

To the field equations must also be added the boundary conditions.* 

Using the surface density of charge a, the boundary conditions may be 
written * 


Div B = 0 


Div E = 4 t r(a„ + a,) 


D,v I. + % - 0 
Div i. + !•-». 




At the boundary between two superconductors an additional 
boundary condition must be added. From equation (9), because of the 
finite nature of B, this may be written 

Curl (AO = 0 or VaL = Vi'L- • • (11) 


2. Steady Fields. 

Let us first apply these equations to the simplest case, that of steady 
fields, where all time differentials may be set equal to zero. According 
to equation (8), E will then vanish; hence, by (7), i n will also vanish 
and we have i = i st so that there is a steady super-current. As in the 
case of a normal conductor, 

curlH = — (12) 

c 

However, according to (5) and (9), the magnetic field and the current 
are related by another equation 

curl (A i) = — - H (13) 

c 

In a superconductor at constant temperature, A is independent of 
position and we may write 

curl i = — H (13') 


* Div = surface divergence (see p. 44). 



XXVI.] 


THEORY OF SUPERCONDUCTIVITY 


469 


Taking the curl of (12) and using (13') and remembering that div fiB. = 0 
and div i = 0, we get 


Similarly, 


curl curl H = — curl i = — H 

c (?\ * 


. . . (14) 


curl curl i = — ^ curl H = — i. . . 


c*A 


(15) 


Imagine a superconductor of infinite extent bounded by the plane 
2 = 0 and filling all space on the positive side of the 2 -axis. Let the 
space where 2 < 0 be a vacuum where there exists a constant magnetic 
field Hq. By virtue of (10) and because 

fcH -S , + a f +“•-» 

H must be constant in both regions. But H, = const, is not a solution 
or (14:) except for H 2 — 0. This means that H can have only a tangential 
component, which we take in the y-direction. Thus 

For z < 0, H, = H x = 0; Il v = H 0 . 

For z > 0, H, = H x = 0; H v = f(z). 

Because of the continuity of H, and by virtue of (14), we shall have in 
the superconductor 


H y — H 0 e » where = l - . . . (16) 

From (12), the current density is given by 
• c dH v Be 

= °> *. = °- . . (17) 

fel| a nff UP >w^ U j t0r V the m,’ the current as wel l as the magnetic field 
a/A InTenP ^ f P1 ' The P enetration depth 1/jS is proportional to 
of ™ t f m ™ ments alwa y s lea d to an order of magnitude 

poS The f n !° r 4 , temperature of °' 5 ° beiow the transition 
pomt. ihe Meissner effect is thus accounted for. 

3. Optical Behaviour of Superconductors. 

curinH s g »; fr f e ?. UenCy J° ptical) fields ’ the ratio * super- to normal 

electrical field°bi t W8 Be ® ause ofthe connexion of the two through the 
trical field mtensrty E (equations (7) and (8)) we always have 


(16) 


= 0; i t — 0 


(17) 


in = o- S (A*,). 


■ (18) 
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is = i 8 ,»e ia>l , 


then 


i n = o(o\i St e i " ,(t+nl2 \ 


■ • . (19) 

• • . ( 20 ) 


The normal current leads the super-current by a phase angle tt/ 2. 
The amplitude ratio is given by the product crcoA. Taking for the order 
ol magnitude of A the depth of penetration mentioned above, the 
amplitude of the normal current turns out to be a hundred times that 
of the super-current even for the infra-red region. This is in harmony 
with the negative result of all experimental attempts to detect optical 
changes when superconduction sets in. This circumstance compels us to 
assume that a normal current exists in addition to the super- current. 

Although the Laue-London theory embraces a complete picture of 
the phenomena, elaborate computations have not yet succeeded in 
supplying an atomistic theory of the superconductivity constant A 
along lines similar to our interpretation of the material constants a, e, 
and /x as presented in the previous sections. To this extent, the present 
topic still belongs to the phenomenological part of electricity. How- 
ever, it was presented here for two reasons: Firstly, certain atomistic 
notions, such as the acceleration of electrons, were used in setting up 
the field equations; secondly, there is the hope that an addendum 
concerned with an atomistic verification may be forthcoming as the 
subject develops. 



CHAPTER XXVII 

The Electrodynamics of Moving Bodies 


1. Electromagnetic Induction in moving Bodies from the Standpoint of 
the Electron Theory. 

An exact formulation of all electromagnetic phenomena in moving 
bodies is possible only with the aid of the Theory of Relativity; in 
fact, consideration of these phenomena formed the starting-point of 
this theory. Nevertheless, the Electron Theory is of such great service 
in representing most of the phenomena that, for the sake of sim- 
plicity of deduction, it seems advisable to consider matters from this 
standpoint first, rather than to begin at once to attack these problems 
with the heavy ordnance of the mathematics of the relativity theory. 
This is particularly so in view of the fact that up to the present time 
only a few experiments are known which are sufficiently accurate to 
detect the small differences in the results of the two theories (p. 476). 
The Electron Theory assumes that the electromagnetic field is located 
in a stationary “ ether ”, i.e. in an absolute space pervading all matter, 
while charge, polarization and magnetization are bound to the material 
bodies and share in their motions. In our consideration of relativistic 
mechanics we pointed out that the existence of an all-pervading ether 
is disproved by the results of three experiments whose accuracy is suffi- 
cient to settle the matter. As a consequence, all results of the follow- 
ing calculations are merely approximations, but very useful ones never- 
theless. If we write the field equations in the form * 

*1 <rr 1 9 E . 47 7 0 P 477 

curl H - c Tt + T dt + T (p+v+ ~ p~ vJ >‘ • W 


curl E = 


10H_ 4tt0M 

c dt c dt’ 



div E = 477 (p+ — p_), 
div H = 0, 


( 3 ) 


term faSwJ u 6 ” from t L he uaual in which the conduction 

iHLctrotoS The 5™?- f ° rm 18 ° hosen for the reaaon that > for example, 

not diS^t th 18 a T oon l ductlon ourrent even when the space charge in (3) is 
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the terms 3P /dt, 3M/&, p and P v are affected by the motions of material 
bodies. In addition to these equations we have the formula for the 
force exerted by an electromagnetic field on a moving charge e: 

F=eE+?|> H ] (5) 

V 

Thus, while a magnetic field exerts no force on a charge at rest, a force 
comes into being as soon as the charge is set in motion. If we look upon 
every force acting on an electrical charge as an electrical force, we 
have an “ induced electrical field strength ” 

E' = l[t>H] (5') 

We now consider a variety of experiments from this point of view. 


(a) Ordinary induction in a moving coil 

The Law of Induction embodied in the second Maxwell equation 
is based upon the assumption that the change in flux consists of a 

time variation of the magnetic field, the path 
of integration being held fixed. Nevertheless, 
the law of induction is also applied without 
further thought to the case in which the flux 
is altered by moving the conductor while the 
field remains constant. With the help of equa- 
tion (4) we shall show that this is really justified. 
If the induced electric field in (5') is integrated 
along a conducting loop there results 



£ E'ds = * £ [z;H]ds. 


( 6 ) 


Now it may be shown readily that the right 
Fig , member is equal to the rate of change of the 

magnetic flux accompanying the motion of the 
conductor. Let us consider two neighbouring positions of the loop at 
times t and t + dt. The change in position may also involve a defor- 
mation. Connect corresponding points of the two contours by displace- 
ment vectors v dt. We thus complete a closed surface over which we 
take the integral of the magnetic field strength. Since the divergence 
of H vanishes in a uniform medium of permeability /x, this surface 
integral is zero, by Gauss’s theorem. Now the element of the lateral 
surface (see fig. 1) may be expressed as 

dS, = — [vds]di t 
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so that 0—— f KdS -{- flldS — dt f H[vds], 

J r Aide 

Hence <ZO = flldS — flldS = — dt f [vE]ds, . . (7) 

r 9 •'side 

i-e. f [vB]ds = — ^ . . (8) 

In a medium of permeability fi the force on the moving charge is /i 
times as large, so that we obtain fx times the induced potential and the 
integral in (8) represents the flux of induction. Thus it amounts to 
the same thing if the change in time of the induction flux takes place 
through a change of field for a fixed loop, or by the motion of the loop 
in a constant field — a fact tacitly accepted in electrical engineering. 

(6) Dielectric polarization induced in insulators 

H. A. Wilson performed the experiment of rotating a hollow cylinder 
of a dielectric substance in a magnetic field parallel to the axis. The 
induced electric field acting on the quasi-elastically bound electrons 
of the substance displaces them relative to the nuclei and causes a 
resultant polarization. Neglecting the internal field, this quantity is 

* -**3 « 

where N is the number of atoms per unit volume and a their polariz- 
ability. Using the relationship between Na and K (cf. p. 452), we obtain 

u K — i [>h] 

P = ~^r c do) 

In our case P is radial and has the magnitude 

p (K-l)vH 

p = ~^r~ no') 

The rotating dielectric has thus assumed a volume polarization and 
is to be compared with a permanently polarized body or “ electret 
Ihere are free charges on the surface, their surface density a being 
given by the magnitude of P, according to p. 281. The detection of 
these charges is accomplished as follows: the inner and outer sur- 

i« im ° f - 6 a ? covered ™ th metal - For the present case it 

immaterial whether the metal coatmgs rotate along with the cylinder 

or not, tor the magnetic effects due to the moving charges are not 

her «- the rotation, the two metal coverings are 

t “ f and e “ thed - ‘he total flux emanating from the surface of 

a rwl tn ° i endS , on the coa tmgs. i.e. there is induced upon them 

diehS, Tf and 0p P° Slt i e the free charge on the surface of the 
tneiectrio. If we now break the connexion between the coatings and 
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annul the volume polarization of the dielectric by stopping the rota- 
tion or cutting off the magnetic field, a surface charge of deiSity <r Jj> 
remains on the coatings This charge gives rise to a potential difference 
between the coatings which can be measured by an electrometer. This 
difference of potential depends on the capacity of the cylindrical con- 

denser and of the electrometer. The measurements confirm the pre- 
dictions of the theory. ^ 


(c) W . Wien's experiment 

The displacement of charge (i.e. polarization) generated in atoms 
by motion m a magnetic field may be detected in a particularly favour- 
able optical case. As discovered by J. Stark, the spectral lines of 
hydrogen are split up into numerous components in an electric field. 
This is the so-called Stark effect. Wien caused hydrogen canal rays 
to pass through a magnetic field which was normal to their direction 
of motion. This is equivalent to an electrical field normal to the plane 
of v and H and of magnitude vH/c. By observing in the direction of 
the magnetic field one actually sees a pattern identical with the Stark 
effect pattern obtained in transverse observation in an eleotric field. 

2. Magnetic Effects of Moving Charges. 

(a) Rowland's experiment 

The next experiment which suggests itself is the testing of the 
equivalence of a mechanically moved charge with an electric current. 
This is accomplished by imparting a large velocity to a charged body 
and detecting the magnetic effects of this artificial rotational current. 
H. A. Rowland set a charged, insulated metal plate into rapid rotation 
and obtained the magnetic effect given by the third term of the right 
member of equation (1) (p. 471). 

(b) The experiments of Rontgen and Eichenwald 

Rontgen and Eichenwald were able to show that the charges 
residing on the surfaces of a polarized dielectric (cx = | P |) correspond 
to a current when the body is set into motion. To demonstrate this, 
Rdntgen caused the dielectric between the plates of a charged con- 
denser to rotate; the magnetic effect calculated from the surface 
density of charge and the rotational speed was found to agree with 
the observed value. 

Using another arrangement, the same investigators were able to 
show that the polarization could be altered by mechanical motion; 
this change causes a magnetic field corresponding to the second term 
of equation (1) (p. 471). In this experiment we again make use of a 
rotating dielectric disc, but the condenser plates are divided into two 
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halves along a diameter, and adjacent parts are held at opposite 
potentials (fig. 2). Thus the sign of the polarization at any point 
reverses when this portion crosses the 
line of separation, so that a vertical 
“ displacement current ” flows in the 
separating layer. The existence of this 
current was detected by its magnetic 
effects. 


(c) The experiment of Trout on and 

Noble 


+ 

J 


— 

4- 


Fig. a 


V % 


Trouton and Noble suspended a parallel plate condenser in such 
manner that it could turn freely about an axis parallel to the plates.* 
The effect to be expected is 
that, as a result of the trans- 
lator motion of the earth, 
the charged condenser will 
turn into a position in which 
the plates are parallel to the 
direction of this motion. In 
order to see that, if we use 
the ether concept, the motion 
of the earth would cause a 
torque, it is convenient to 
imagine the plate charges e 
concentrated in two small 
spheres whose separation is equal to that of the condenser plates (fig. 3). 
Let the density of charge in these spheres be p. The translator 
motion of the charge -f e due to the earth’s motion corresponds to 
a current element ds for which 

lds = ev. f ( 11 ) 

According to the Biot-Savart Law, this current element gives rise to 

a magnetic field at the position of the negative charge. This field is 
given by 



F- *H 


Fig. 3 


H = 


I[dsr]_. Jvr] 
— & — 


cr 3 


cr 


3 > 


( 12 ) 


where is the vector drawn from the positive to the negative charge, 
mue the field is normal to the plane of v and r. The force with which 

72, M32?m3 D ). and T ‘ N0Ue ' mi Tran> " A ’ 202, p ' 195 < 1903 >: Proc - R °y- See.. 

thol 4116 ? ph , are dividod 11140 8,1068 normal to v, eaoh of thiokneas it, 

Sms * l Drrent ^ 6m(mt M * • ». Integration over all thm 

yield* the volume of the sphere multiplied by pv, whence (11) follows. 
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this field acts on the negative charge, 
velocity v , is given by 


which is also moving with 


F = — e = __ e l t v l vr ]] 




This force, which is normal to the plane of H and v, is of magnitude 



eh) 2 sin 6 




Similarly, an equal and opposite force acts upon the positive charge 
as a result of the field caused by the motion of the negative charge. 
The two forces together furnish a torque parallel to the axis of rotation 
and of magnitude 

2 2 

Ai = Fr cos# = — — sin# cos 6 

r c 2 

y2 

= £-^sin2 6 (15) 


This turning moment tends to turn the plates parallel to the “ ether 
wind ”. Now e 2 /r is the electrostatic energy £7 el of the system. Insert- 
ing this value in equation (15) we obtain the correct formula for the 
parallel plate condenser, except for a factor 1/2, which is due to the 
simplifying assumptions made regarding the distribution of charge. 
The correct formula is 

M= sin28 (16) 

Thus a condenser oriented at an angle 77-/4 with the direction of the 
earth’s orbit should be subject to a torque U t \{vjc ) 2 when charged. 
A moment of this magnitude should be easily observable, yet eveD 
the most delicate experiments, especially those conducted by Tomas- 
chek, failed to give any indication of such a torque. The result of the 
Trouton-Noble experiment , which is one of the very few experiments per- 
mitting of observation of quantities of the second order in v/c, contradicts 
the theory of a stationary ether. 


(d) Variability of the mass of the electron 

The magnetic field of a moving point charge represents a certain 
energy distributed in space, the generation of which requires a certain 
amount of work; i.e. the electron possesses inertia which, as calculation 
shows, increases with increasing velocity. But since the mass is mani- 
fested only by its inertia, this indicates that the inertial mass of an 
electron increases with velocity. The amount of this increase is different 
for acceleration in the direction of motion and normal to this direc- 
tion. We therefore speak of longitudinal mass and transverse mass. 
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According to a calculation made by Abraham, these quantities for a 
rigid spherical electron are: 


longitudinal mass m l = m 0 ^ 


transverse mass m t = 


1 + 1 


= 


1 + A 


tf\ 
c 2 / 
v*\ 

#)■ • * 


(17) 


(18) 


Again, the lowest power of v/c which appears is the second. In addi- 
tion to our previous experiments this phenomenon may also be called 
upon to decide for or against the concept of a stationary ether. As 
mentioned on p. 430, this variability of electron mass has been 
observed for some time. The theory of relativity leads to a simpler 
formula which also contains correction terms in (v/c) 2 and which 
deviates from the above by about twenty per cent. The most exact 
measurements have decided unanimously against the formulce (17) and (18) 
derived from the ether concept, and in favour of the relativistic formula. 


3. Propagation of Electromagnetic Waves in Moving Media. 

(a) Fizeau's experiment 

The classical experiment on the propagation of light in moving 
media is that of Fizeau * in which light was made to pass through 
flowing water. The small difference between the index of refraction 
of moving and stationary water was measured by an interference 
method. Approaching the question quite naively, one might believe 
that the velocity of the flowing water simply would be added to the 
phase velocity as given by the index of refraction for water at rest. 
It is apparent, however, that this view leads one into difficulties, for 
evidently this convection effect must exist to the full extent even for 
very tenuous moving matter (e.g. in a rapidly flowing gas), while in a 
perfect vacuum there can be no effect. This would mean a discon- 
tinuity contrary to all experience. The actual result of the experiment 
showed that the wave velocity u is increased by the velocity of flow 
v multiplied by the factor (1 — 1/n 2 ). The factor (1 — 1/n 2 ) which 
approaches zero when n approaches unity is called Fresnel’s con- 
vection coefficient. This result is obtainable also from the theory of a 
stationary ether, but follows much more simply from the relativity 
theoiy as a consequence of the addition theorem for velocities (cf. p. 244). 

(6) The Michels on- Morley experiment 

A large number of experiments intended to detect the influence 
or the motion of the earth on the propagation of light have been pro- 
posed. For all of these, with one exception— the Michelson-Morley 
experiment— the acouracy of observation is not great enough to detect 

p 0ompta mdw ‘- 881 p - 349 (18sl)i Ann - d - PK V- “■ Chm " a*.. 8, 
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delations of higher order than the first power of e/e. The stationary 
ether theory yields no first-order effect for a common uniform trails 
lation of source, observer and intervening parts of the apparatus, 
lo that extent all these experiments also confirm the theory of a 
stationary ether. This theory, however, does demand a second-order 
efiect, and the Michelson-Morley experiment is the only optical ex- 
periment sufficiently sensitive to detect this effect. This experiment 
has already been discussed in detail on p. 238. The absence of a second- 
order effect here , the negative resuU of the Trouton-Noble experiment and 
the departures of the mass variability of the electron from the Abraham 
formula (p. 477) are evidence for the fact that the concept of a stationary 
ether cannot correspond with reality. 

(c) Doppler effect and aberration of light 
These phenomena have been discussed in detail on p. 245. 


(d) The experiment of Sagnac 

In this experiment the source of light, the observer, and the inter- 
mediate apparatus are given a common rotational motion.* Thus, 
in spite of the fact that all parts 
have a common translational motion, 
there is a rotational effect to be ob- 
served. The experimental arrange- 
ment is shown schematically in fig. 4. 

The semi-silvered plate divides the 
beam into two portions — one travers- 
ing the quadrilateral formed by the 
mirrors in the direction of rotation, 
the other in the opposite direction. 

An interference pattern is formed 
when the beams unite, just as in 
the Michelson-Morley experiment. 

The interference fringes are regis- 
tered on a photographic plate at- 
tached to the apparatus. Fig * 4 

As in the Michelson-Morley experi- 
ment, assume the classical compounding of velocities and compute the 
times required for the light to go in both directions and arrive again at 
G. Taking the axis of rotation as origin, suppose the light travels 
along a curve whose polar equation is r = If, at a given instant, 
an element of the curve makes an angle 6 = cos ^(rdf/ds) with the 
direction of its translational motion arising from the turning of the 

* G. Sagnac. Comp tee rendue, 157, p. 708 (1913); Journ. de Physique, 4, Ser. 5, 
p. 177 (1914). 
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whole apparatus, then the relative speed when going around in the 
same direction as the rotation will be 


v_ = c — cor cos 6 = c — r 2 < 


S’ 


and in the opposite direction 


•2y.. ^ 


v + = c 4- 

as 


The difference of the two corresponding time intervals is 

c ~ r2 oj c + r 2 co 

ds as 

Since the second term in each denominator is small, t his becomes 


c — r*to 


a r -/*“*-*? h* _ 


4a) Q 


where 8 is the area enclosed. Then, by p. 239, the displacement of the 
fringes amounts to 


AZ = — CX)Sy . 
CA 


. (19) 


where <o is the angular volocity of rotation, S the area of the surface 
enclosed by the light path and A the wave-length of the light. This 
displacement is, in fact, observed. Experiments were performed using 
widely different values of both ai and S. In one trial Sagnac used a 
rapidly rotating disc of about 1 m. diameter; in another trial the 
apparatus was taken on board ship ; the angular motion was obtained 
by sailing over a curved course. In recent years Michelson and Gale * 
used the rotation of the earth itself; in order to compensate for the 
small value of o>, a light path several kilometres in length was used. 
Technically, this experiment differs from that of Sagnac in that the 
zero position of the fringes cannot be determined by stopping the 
rotation ; hence it was necessary to have a second light path, enclosing 
a negligible area, so arranged as to give fringes serving as a reference 
mart. Further, unless the circuit is at one of the poles, the angular 
velocity vector is not normal to the plane of the circuit, and only the 
normal component « effective. The result of this experiment was in 
complete accord with the calculations. The fact that the stationary 
ether concept leads to correct results for rotating systems in optical 
expenments also is no more a proof of the existence of the ether than 

S? of the 00110113 “ d centrifugal forces 

Id. p. The Michelson-Gale experiment is the optical analogue of 

A. A. Miohelson and H. G. Gala, Asirophyt. Jaum., 61, p. 140 (1926). 
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'' ' Foucault pendulum. It is of importance from the fact that it makes 
all attempts to explain the negative result of the Michelson-Morlev 
experiment by convection of the ether seem futile, for it seems absurd 
that the ether could be carried along by the earth completely in trans- 
lation and not at all in rotation. As explained on p. 258, the above- 
mentioned mechanical experiments are explicable under the generalized 
theory of relativity without using the concept of an ether at rest 
and the same is true of the Sagnac experiment. 


4. # Relativistic-invariant Form of the Electromagnetic Equations. 

The mathematical formulation of the theory of a stationary ether 
involves the application of formulae (1) to (5) (p. 471), with the 
derivatives d/dt ... for the moving bodies formed according to the 
Galilean transformation; i.e. we equate the total time rate of change 
of a quantity u to the sum of the pure time rate du/dt for a fixed point 
and the spatial change v grad u due to displacement of this point, as 
in hydromechanics. But, as already indicated, this theory leads to 
conflicts with experience for effects of second order. Small as are the 
differences between the theoretical and experimental results, they 
nevertheless compel us to abandon the theory. Now it is very re- 
markable that, as will be shown below, the equations of the electron 
theory may be set down at once as equations connecting world vectors, 
which means that they are invariant under the Lorentz transformation 
and not under the Galilean. One might at first believe this to be a 
matter of course, since the Lorentz transformation was introduced in 
such a way that the wave equation remained invariant under it. It is 
to be noted, however, that while the wave equation is a consequence 
of the electromagnetic field equations, the electromagnetic equations 
are not, reciprocally, consequences of the wave equation, since the 
latter also occurs elsewhere, e.g. in the Theory of Elasticity. Again, 
the invariance of the electromagnetic equations under the Lorentz 
transformation — which is, as a matter of fact, identical with the 
Galilean to a first approximation — shows that the stationary ether 
theory also fails to indicate first order effects when all parts of an electro- 
magnetic system share a common motion. This fact was stated in the 
preceding section, but without proof. 

In order to obtain a simple form of equations (1) to (4) (p. 471), 
we restrict our considerations to weakly magnetic substances, so that 
the term 0M/0£ in equation (2) may be omitted. In addition, let us 
combine the displacement current 0P /dt with the conduction current, 
i.e. let the terms ( 4 t T/c)p { Vi consist of the sum of a convection current 
of free charges and a part due to the displacement of the bound charges 
from their equilibrium positions. The term 0P /dt then appears to be 
absent, but actually pv includes both the conduction current and the 
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contribution of the displacement. Writing l/ic in place of t , the co- 
ordinate forms of equations (I) and (3) become 


+ 


+ 



dH t 

dH v 

diE x 

477 , 

-V-x) 


dy 

dz 

di 
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. dH x 

diE u 
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+ 3z 
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diE x 

dx 
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dy 

diE, 

dz 

r= 
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U (20) 


while equations (2) and (4) are 

diE, diE 


dy 

y 

dz 

v X 

dl ~ 

diE x 

diE, 

dHy 

dz 

dx 

di ~~ 

diE v 

diE x 

dH z 

dx 

dy 

di ~ 




dll x . d H 


dx 


+ 


• + »£-o 


( 21 ) 


dy ' dz 

We proceed to show that these are the component forms of two equa- 
horn between world vectors, or tensors. First we investigate how the 
density of charge p changes under the Lorentz transformation Start 
from the assumption that the charge « = P Ar is independent of the 
state ot motion, and so does not change in this transformation. On the 
other hand, all volumes are reduced in the ratio VI — fi 2 : 1 bv the 
Lorentz contraction; hence to a moving observer the charge density 
P appears to be increased in the ratio 1/yl- fP : 1. Denoting by 
the density of charge as measured by an observer moving with itf we 

P 0 

VI — ft 2 0T P x/l ~ ^ Po= invariant. (22) 

According to p. 266, the components of the four-velocity q r are 


P = 




V, 


V 


to 


Vi-fi 2 ’ Vi-p’ Vi - jB*’ vi - / 3 *' 

0 f bl2 m th B tiply ea0h th f e ex P re9sions by the invariant of (22) we 
obtain the components of a new world vector which we call the 

<» 711 ) 
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four -cur rent s r \ the positive charges yield the four-current s \~~ the 
negative, the four-current Since the factor \/!^S 2 cancels out in 
the multiplication, the components are the right members of equations 
(20), apart from a factor 4 tt/c. Consequently, the left members must 
also be the components of a world vector. This is derived from a world 
tensor by an operation which is independent of the system of co- 
ordinates. We aready applied such an operation in three dimensions 
in the Iheory of Elasticity without introducing any special designation 

for it. If, in the three-dimensional case, say, we have a tensor in the 
form 

* — a 2 . j • k, 

and we use this tensor as a postfactor in multiplying with the nabla 
operator, we obtain 

VT* = Vaq . i -f- \a 2 .j -j- Va 3 . k 

— div a x . i + div a 2 .j -f- div a z . h , 

i.e. a vector whose components are div a v div a 2 , div a 3 . We shall refer 
to this operation as the formation of the divergence vector of the tensor 
'F, and shall designate the operation by 

V v F = div x F (23) 


The extension to the four-dimensional case involves nothing new. If 
we call the world tensor whose components are 


0 

u. 

-H v 

N 

•cw 

1 

-H z 

0 

H x 

-iE v 


-H x 

0 

— iE , 

iE x 

iE v 

iE 9 

0 


the electromagnetic field tensor O r , we see that the components of the 
world vector of equation (20) are obtained from this world tensor O 
precisely by the operation given above. Hence equation (20) may be 
written . 

div r <I> r = — ( s r + + s r -) (21) 

c 


But according to equation (65) (p. 251), the system (21) above states 
that the antisymmetric world tensor O r is derived from a world vector 
potential V r by taking the curl in four dimensions: 

O' = curl r V r (25) 

The formulation of the laws of electromagnetism given by (24) and 
(25) seems, at first, somewhat colourless; nevertheless, compared with 
the usual three-dimensional vector equations, it has the great advan- 
tage that it combines the electric and magnetic fields so as to produce 
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a single unified field specified by the tensor O r , the resolution into 
electric and magnetic components depending upon the state of motion 
of the observer. A field which appears to an observer at rest to be 
purely magnetic may, for a moving observer, appear to have an elec- 
trical component. Moreover, the change from one system to another 
is reduced to the purely formal operation of transforming four-dimen- 
sional tensor components. For example, in the experiment of W. Wien 
(p. 474), the world tensor O r has the following components for the 
system in which the apparatus is at rest: 

0 H 0 0 

—H 0 0 0 

0 0 0 0 

0 0 0 0 . 


On the other hand, consider a canal ray particle moving in the direc- 
tion of the z-axis with velocity v. Since 

V — l cos(/> + x sin (f>, 
x' — —l sin<£ -j- x cos(/) t 
tan<£ = — ip t 

the rule for transforming of tensor components (pp. 34, 249) yields 

0 H cos(f> 0 0 

—Hcos</> 0 0 —H sm</> 

0 0 0 0 
0 H sm<j> 0 0 

We see that an electrical component has made its appearance, viz.: 


iEy = H sm6 SM= 


VI 


or 


E' 


_ Hv/c 


vi 


which is identical with that derived from equation (4), p. 471 as far 

“ ^" antltle f of “ oon< ? order - Likewise, the magnetic component 
remains unchanged as far as second order quantities. * 

theorem!*^' DeriTe ““ Fresael ooeffioient ^ the relativistic velocity addition 




PART y 

The Theory of Heat. Phenomenological Part 


The Theory of Heat, like the Theory of Eleotricity, may be developed from 
two different points of view. The phenomenological method employs only con- 
cepts like temperature, quantity of heat, &c., which are taken from the macro- 
scopic world of observations and which can be measured directly. The laws thus 
obtamed have the advantage of being free from hypothetical assumptions. On 
the other hand, m the Theory of Heat, e.g. in connexion with the Law of Entropy, 
we fed the need of a deeper “ explanation ” Such an interpretation, deeper 
because more vivid, is furnished by the atomistic, statistical view. Here again, 
“ ik 6 , elecfcl ?? al case, we find that although we develop the theory separately 

In^n th K tWO f S f dea v! ^ ^ often prOVe advanta geous, while we follow one line of 
approach, not to shut our eyes completely to the other. 




CHAPTER XXVIII 


Theory op the Conduction of Heat 

1. Definition of Temperature, Quantity of Heat, Thermal Capacity and 
Specific Heat. 

The sensory organs of the skin permit us, within certain limits, 
to perceive temperature. In particular, we can decide readily which 
of two objects is the warmer. However, this qualitative measure of 
temperature is not sufficient for physical measurements and we must 
seek a phenomenon which is causally connected in some way with 
the temperature of a body. The increase in volume of a substance, 
e.g. mercury, with rising temperature is a phenomenon of this kind. 
For convenience of observation of the expansion, the mercury is con- 
tained in a bulb and allowed to expand into a fine capillary tube at- 
tached to it. A thermometer of this kind is calibrated by immers- 
ing the bulb in melting ice and marking the position of the mercury 
filament, then immersing the bulb in the steam rising from water 
boiling at standard atmospheric pressure and again marking the 
position of the thread of mercury. The interval is then divided into 
one hundred equal parts. We have thus obtained a provisional scale 

we repeat this process with another ther- 
mometnc substance, we find that accurate observation reveals a diffi- 
culty; viz. since the volume is not strictly a linear function of tem- 
perature, two thermometers filled with different substances and cali- 
brated as above will not agree exactly at intermediate points when 
immersed m the same substance. We might of course avoid this diffi- 
culty by the arbitrary choice of a standard thermometric substance 
say mercury. There is, however, a better plan: we obtain almost 
perfect agreement by using gases which are as far as possible from 
their boiling-points. For practical reasons a gas thermometer is usually 
arranged so that we measure not the expansion but rather the increase 
“/ riT oocurrm g when the volume is held constant. Since indi- 

Zr^ ere r a ar T when such 8 ases are in the neighbourhood of 
theu- boihn^ temperatures, we choose as a thermometric substance 

nnderordmary conditions, is farthest from its boiling- 

^ thermo ™*rie measurements are bash 

upon a temperature scale determined by the increase in pressure of a con- 
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fined volume of helium. Moreover, the gas thermometer defines a natural 
zero point for our scale — the so-called absolute zero — which is at — 273° 
on the Centigrade scale defined above. The determination of this zero 
point will be discussed later. The temperature scale defined on this 
basis departs but little from one defined on purely theoretical grounds, 
the thermodynamic scale, which will be explained later. We Bhall 
denote temperatures on the Centigrade scale by t , those on the absolute 
scale by T . There will be no danger of confusing t with the time, since 
pure thermodynamics deals only with equilibrium. In the Theory of 
Heat Conduction, however, we shall denote the temperature by 9. 
Since, in the latter case, we are concerned only with temperature 
differences, it is immaterial whether we understand this to be Centi- 
grade or absolute temperature. 

In addition to temperature there is a second important concept — 
that of quantity of heat — which is given by ordinary experience. If 
we drop a 1 gm. mass of iron at 100° C. into 10 gm. of water at 10° C. 
and, in a second experiment, drop a 10 gm. piece of iron at 100° C. 
into the same amount of water at 10° C. as before, we observe, 
after the systems have come to equilibrium, that the rise in tempera- 
ture of the water is about nine times as great in the second oase as in 
the first. This leads to the idea that a body possesses a given heat 
content which is equal to the product of the temperature by a quantity 
proportional to the mass. This latter quantity is called the heat 
capacity. It is to be noticed that we deal only with differences in quan- 
tities of heat, so that the absolute value is undetermined. The pro- 
cess of co min g to equilibrium in the above example may then be 
interpreted as the transfer of a given quantity of heat from one body 
to another. Without forming special conceptions of the nature of 
heat, we postulate a conservation of quantity of heat for such processes. 
As a unit of quantity of heat, we choose the amount of heat necessary 
to raise the temperature of one gramme of water from 14*6° C. to 
15*5° C., and call this a small calorie (sometimes “gramme calorie”). 
There is a larger unit, the large calorie (or kilogram calorie) which is one 
thousand times as great. 

Experimenting further, we find that 1 gm. of aluminium at 100 C., 
when placed in 10 gm. of water at 10° C. imparts to it a greater rise in 
temperature than does 1 gm. of lead at 100° C. Hence we set the heat 
capacity equal to the product of the mass by a constant of the material. 
This constant is taken to be unity for water at 15° C. The quantity 
of heat in calories required to raise the temperature of 1 gramme of a 
substance 1° C. is called the specific heat of the substance, preferably 
called the specific heat per gramme. There is another quantity which 
is important— the specific heat per mol, i.e. the thermal capacity of one 
mol of a substance. The advantage in using this molecular heat is that 
in comparing the molecular heats of various substances, the same 
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numbers of molecules are involved. We shall denote the thermal 
capacity by C, the specific heat per gramme by c 9 and the molecular 
heat by c. In order to determine the equilibrium temperatures in the 
above examples we need only equate the quantities of heat involved, 
before and after mixing. In the example of 1 gm. of iron at 100° C. 
placed in 10 gm. of water at 10° C., the end temperature x is obtained 
from the relationship 

1. c 0 . 100+ 10.10= (c,+ 10)*, 

where c g is the specific heat per gra mm e of the iron.* 

In spite of this simple applicability of the concept of heat content, 
it must be mentioned that these simple relationships exist only wheD 
the bodies under consideration do not undergo other changes. In 
general, the heat capacity is by no means a function of the temperature 
alone (cf. p. 502). 


2. The Differential Equation of Heat Conduction; Initial and Boundary 
Conditions. 

Common experience shows that any temperature difference be- 
tween parts of a body reduces to zero after a time, so that heat must 
flow from the places of higher temperature to those of lower tem- 
perature. It is natural to assume that the density of the thermal 
current, i.e. the quantity of heat passing per second through unit area 
of a surface normal to the direction of flow, is proportional to the 
negative of the temperature gradient. This assumption is confirmed by 
experiment. The factor of proportionality k is called the thermal 
amductvwty of the substance. If now we imagine a certain volume of 
a body isolated, the rate of decrease of the heat content is 


a W 

dt 




where 0 is the temperature and p the density. This decrease is due to 

conduction, the heat flowing away through the bounding surface. The 
flow is given by 

dW r 

~Tt == ~ k f ( 2 ) 

Equating the two, we have 



tj hot body (or bodi *> * 

1 . <*(100 - x) - 10 (s - 10), 
whioh is, of course, the same as the equation given above. 
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of heat conduction 
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equation 


dd_ 

dt~ 


—— A 6 = 77 ? A# 
P C o 



The constant m = klpc g was named the diffusivity by Kelvin. 1 * 

As in the case of all partial differential equations, the solution of 
the problem is not defined unless initial and boundary conditions are 
given. Thus there must be given a definite distribution of temperature 
at time t— 0, and also information as to what is happening at the 
boundaries of the body. The surface of the body, for example, may 
be completely insulated. In general this will not be the case; as a rule, 
heat will pass outwards through the surface. It is found that the 
surface density of the flow of heat normally outwards through the 
surface is approximately proportional to the difference in temperature 
between the surface of the body and the surrounding medium. 
The factor of proportionality is called the surface conductivity , or 
emissivity. 


3. A Simple Example of the Integration of the Equation of Heat Con- 
duction: Penetration of the Daily and Yearly Temperature 
Variations into the Interior of the Earth. 


Take the surface of the earth to be plane and assume the surface 
to undergo periodic temperature variations as a result of the fluctuating 
radiation to which it is exposed. The boundary conditions here are 
especially simple, for there is no heat flow through the lateral surface 
of a cylinder whose generators are vertical. This is because the iso- 
thermal surfaces are planes parallel to the earth’s surface, as may be 
seen intuitively; hence the entire flow is normal to the surface of the 
earth. Taking the x-axis along the inward normal to this surface, the 
differential equation becomes 


dd d 2 9 
dt “ m dx 2 



Since 6 at the surface is a periodic function of the time, we take 

d=f(x)e Utt (5) 


Substituting this value 
equation for /(x): 


in (4') we obtain the following differential 




* Clerk-Maxwell called this quantity the thermometric conductivity, but Kelvin’s 
terminology is more widely used. 
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which we seek to solve by setting f=ae XK . The “characteristic 
equation ” becomes 

mX 2 — ico = 0, (7) 

whence 


JiMl + t) (8) 

V m v 2m 


The negative sign is the one to be selected, for if we limit our inquiry 
to the penetration of the surface variations, there must be a decrease 
toward the interior. Then 


0 = dQ~^ui!2mx — (*/V2mw)] 



This i s the equation of a damped wave propagated with velocity 

v = V2 mat. This shows that the velocity is greater for the diurnal 
variation than for the annual, but at the same time the damping is 
greater for the former. 

It is also easy to fulfil the initial time conditions for x = 0. Since 
the real as well as the imaginary part of (9) represents a solution, we 
may obtam the solution for any values of 9 0 and (d0/dx) o from the 
general solution containing two constants of integration: 


6 = Ae~ Vul2mx cosw 



\ / 2m<D 


+ 2m* B [ nco ( t x\ 

\ V 2 mw ) 



If, for example, we begin counting time from a moment when 
(/ at a; = 0 is a maximum, we have the solution 


0 = ^ e~~ Vu i 2mx 


COSa> 





where 9 and 0^ 
temperature. 


are, of course, the differences reckoned from the average 



CHAPTER XXIX 

The Equation of State of Thermodynamic Systems 

1. Definition of the Thermodynamic Variables and the Relationships 
between them. 

In thermodynamics, the treatment of which we begin here, the 
word “ system ” is used as the most general and neutral expression for 
an aggregation of bodies considered as a whole and assumed to be 
isolated from their surroundings. A system may consist of several 
distinct physical parts called phases. For example, if we have a 
saturated NaCl solution containing an undissolved crystal of NaCl, 
the whole being covered by a bell jar, then we have a system of three 
phases: a solid phase (the crystal), a liquid phase (the solution) and a 
gaseous phase consisting, in this instance, of a mixture of air 
and water vapour. Since all gases are miscible, a system in equili- 
brium can have but one gaseous phase, but may have several solid or 
liquid phases. For example, we can introduce an additional solid 
phase by placing a crystal of quartz in the solution. The thermo- 
dynamic behaviour of the system is dependent upon the number of 
“ independent components ” or “ substances We understand this 
to mean the number of freely variable chemical compounds or elements 
contained in the system. To obtain this number, determine first the 
number of elements present (combined or free) and subtract from this 
the number of elements defined chemically in terms of the remaining 
(independent) elements. For example, water and ice constitute a two- 
phase system of one component, since the quantity of oxygen present is 
determined by the quantity of hydrogen. On the other hand, a solution 
of NaCl in water is a single-phase system of two components. The 
state of a system is determined, in the thermodynamic sense, by giving 
the mass and chemical constitution of each phase as well as the pres- 
sure, volume and temperature. These quantities, however, are not 
independent of one another, but are connected by a relationship called 

an equation of state. , 

We limit our present considerations to a single-phase system whose 

chemical constitution is invariable, e.g. a system consisting of 10 gm. 

of liquid benzene. Since the mass and chemical constitution are faxed, 

there remain only the three quantities temperature t (or 1 ), pressure 
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V and volume V as true thermodynamic variables. Temperature and 
methods of measuring it have already been discussed in Chap. XXVIII 
(p. 487). The volume is measured in cubic centimetres and the pressure 
in dynes per square centimetre. In some instances the pressure is 
measured in atmospheres, this being the pressure capable of support- 
ing a 76 cm. column of mercury at 0° C. 

I Atm. = 76 X 13*6 x 981 = 1-013 x 10 6 dynes/cm. 2 = 1-013 X 10 5 newton/m. 2 


Now there is generally an equation of state 

F(P, t, V) = 0, (1) 

according to which the condition of a simple system is determined 
by fixing the values of two of the variables. An equation of state 
which is valid over large ranges can be given only for gases. For liquids 
and solids these equations are more like interpolation formula* holding 
over a limited range. But, as we shall see presently, important con- 
clusions may be drawn from the mere fact that an equation of state 
exists. 

We now consider those changes of state in which each variable in 

turn is held fixed. We thus arrive at definitions for three coefficients 
viz. the following: ' 

The isobaric volume coefficient of expansion a is defined as the rate of 

change of volume per degree divided by the volume at 0° C., the pres- 
sure being held constant, i.e. 


a 


= I V (P’ t + Al)- V(p, t) 1 /3F\ 

L I A A V T I XT I • 


At 


Vo \a</ 


( 2 ) 


The subscript attached to the partial derivative indicates the quantity 
which is held constant. J 

The isometric (or isochoric) pressure coefficient p is defined as the 

rate of pressure change per degree divided by the pressure at 0° C. 
the volume remaining constant: ’ 

Po\dt)v • (3) 

Finally, the isothermal compressibility k is defined as the rate of 
change m volume per unit pressure (or per atmosphere) divided by 
the existing volume, the temperature being held constant- * 


= -11^ g(<. p + Ap) - V(t, P ) _ i 

T 7 a ~ — — -- 


A p 


/9F\ 

\dp/t 




tivfsi^riT deCrea f CS “ crea3in 8 P ress ure we insert the nega- 
tive sign to make k positive. The reciprocal of k is called the isotkJLl 
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the coefficients are so defined that a and p are referred to the volume 
or pressure at 0° C., while k is referred to the existing volume. The 
advantage of this convention becomes apparent in the case of gases. 
For liquids and solids it is practically immaterial whether the change 
is referred to the volume at 0° C. or the volume at the existing tem- 
perature. Again, it is immaterial whether we write t or T in defining 
a and p, since temperature differences only are concerned. The exis- 
tence of an equation of state means that the three coefficients are not 
independent. If, for example, we imagine the equation solved for p, 
then 

P = P(V, t) 


and 


d P = ( d -i) 

W/t 


dV + 


8 )* 


( 5 ) 


If p is held constant we have, after division by dt , 

_ t^v 


0= \ 4- ( 3 -l\ 

\dv/ l \dtj v r \dt/; 


Inserting the coefficients defined above in place of the derivatives, 


o 


£=-•- V' 
K p 0 V 


( 6 ) 


Ex. 118. A mercury thermometer whose capillary is just filled at 45° C. is 
warmed to 50°. What is the pressure in the capillary if the coefficient of expan- 
sion a= 18 X 10 -6 deg. -1 and the compressibility x = 39 X 10 -7 Atm. -1 ? 


2. The Equation of State of an Ideal Gas. 

The following empirical relationships have been found to hold for 
gases far removed from their point of liquefaction: 

1. The Law of Boyle and Mariotte. — At constant temperature the 
volume of a gas is inversely proportional to the pressure, i.e. the pro- 
duct of pressure by volume depends only upon the temperature: 

pV{p, t) = p 0 V{p 0 ,t), (7) 

where p 0 denotes some standard pressure, say 1 atm. 

2. The Law of Gay-Lussac.— The isobaric volume coefficient has 
the same value a = 1/273T, independent of the temperature, for all 
gases. Thus by the defining equation (2) (p. 493), 

F(p, t) = V(p, 0) (1 -4- a/), (8) 

and in particular , 

F(p 0 > *) = V (Po> 0) (1 + ai) (8 ) 

Combining (7) with (8') we obtain 

/>F(p, t) = p 0 (l + <*T) V{Pq> 0) ^ 
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We now change the zero of the temperature scale from the temperature 
of melting ice to the temperature — 1/a = — 273*1° C., and take 


t — T — 273-1 = T — . . 

a 

Then we have 

Po^iVdi 273*1 )T Po^O rp irp 

pV - 273T 273*1 ^ ~ A1 


( 10 ) 

( 11 ) 


We have already referred to this scale of temperature as the absolute 
scale. The constant A, which is equal to 1/273*1 of the product of the 
pressure and volume of the gas at 0° C., varies according to the mass 
and nature of the gas under consideration. However, we obtain a 
universal gas equation by using another fact of experience as follows. 

3. The Law of Avogadro . — The densities of two gases which are at 
the same pressure and temperature are proportional to the molecular 
weights. If M is the mass of the gas, the density is M/V and so 

Mi M 2 

Y 1 y 2 = 1 ( 12 ) 


where wq, m 2 are the molecular weights. Equation (12) is equivalent to 


The quantity 


m 1 V 1 _m 2 V 2 
M x M 2 * 




is the volume occupied by one mol. This, as well as all symbols referred 
to one mol (gramme molecule), will be denoted by small letters. 

If we write the gas equation for one mol (7 = v), the constant on 
the right becomes the same for all gases, since by Avogadro’s Law, 

one mol of any gas occupies a given volume v 0 at a given temperature 
and pressure. We then have 


Tv = ^T = RT (13) 

The numerical value of this so-called universal gas constant is 
R = 8*314 X 10 7 erg/deg. mol = 8310 int. joule/deg. kg.-mol. 

Since the mass of one mol is m grammes, where m is the molecular 

weight, multiplication by M/m yields the Mowing result for any mass 
M of gas J 

pF= m* r = nfl2, > • • • • 

where n is the number of mols of gas present. 


. . ( 13 ') 
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The product yv vanishes for T = 0. This means that an “ ideal ” 
gas which obeys (13) would have zero volume for a finite pressure, or 
would no longer exert pressure, the volume still being finite. The 
former alternative seems impossible to visualize. Now the atomistic 
view reveals at once the nature of the simplification inherent in the 
definition of an ideal gas which leads to such paradoxical results. 
According to the kinetic theory of gases, the pressure exerted by a 
gas on the walls of its container is due to the innumerable impacts of 
the rapidly-moving molecules. The molecules themselves may be 
assumed to be particles having no extension, so that the entire volume 
v is available to every molecule. The result is that at absolute zero, 
where the molecules no longer are in motion, the volume of the gas 
is zero. In reality we know, however, that the colliding molecules 
behave like elastic spheres whose diameter is of the order of 10 -8 cm., 
so that they occupy a small, but nevertheless finite, volume. Further, 
in deriving the gas equation from the kinetic theory, it is assumed 
that the molecules exert no forces on each other. We know, however, 
that large intermolecular forces of attraction exist in liquids and solids 
(cohesive forces) and it is thus probable that such forces exist in the 
gaseous state also, but that they are much feebler, corresponding to 
the greater average distance between molecules. 

We may now derive the equation of state of a mixture of ideal gases. 
Let the masses M v M 2 . A/ 3 , ... of gases having molecular weights 
mj, m 2 , ... be contained in a volume V. In order to obtain the 
equation of state for this mixture we need the following law. 

4. Dalton's Law of Partial Pressures . — The pressure exerted by a 
mixture of gases is equal to the sum of the partial pressures which each 
gas itself would exert if it filled the entire volume. From the kinetic 
theory standpoint this law is a matter of course, since the total effect 
of the collisions with the walls must be equal to the sum of the effects 
due to the separate kinds of molecules. Thus for the first component gas, 


and similarly, 


P V =^±RT= n, RT, 

ri m x 

p.V = M*RT= n 2 RT. 

ri mo 


Adding all such equations and applying Dalton’s Law, we obtain 

Mo 


\*h 


+ 




+ ...^RT={n 1 + n t +n 3 +...)RT=nRT (14) 


and 


”, 


(14a) 


sp • • • • 

+ n 2 + *3 + • • • 

Using this relationship we can define the apparent molecular weight 
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of the mixture as the molecular weight m of a gas which, for the same 
mass M, would satisfy the same equation of state, viz. the equation 


y _ M rj, _ M x -{- M 2 + M z -f- . . . 

* m m 


RT. 


(15) 


Comparison of (14) and (15) gives the apparent molecular weight of 
the mixture as 


- _ M l + M 2 + M 2 -\- . , , 


(16) 


m 


m 


m 


3. The Equation of State of a Real Gas. 

The atomistic view shows us how to improve the equation of state 
of a gas: it is necessary only to take account of the quantities which 
were neglected in arriving at the equation of an ideal gas. Firstly, we 
must subtract from the macroscopic volume v a quantity b which 
allows for the space occupied by the molecules. This quantity b is four 
times the volume actually occupied by the molecules, which are assumed 
to be spherical. The reason for this is that in a collision the centres of 
two molecules approach no nearer than a diameter, so that for impact 
we may imagine half the molecules to be mere points, while the re- 
mainder occupy eight times their actual volume. Hence, for the 
average, we use four times the true volume of the molecules. 

The cohesive forces operating on any given molecule in the interior 
of the gas cancel out, since they are essentially equal in all directions; 
not so, however, for a molecule near a wall of the vessel. It will be 
drawn back somewhat into the interior of the gas, if we ass um e no 
attraction between the gas molecules and the wall (adhesion). This 
has the effect of a small inward pressure in addition to the observed 
manometric pressure p. This added pressure must be inversely pro- 
portional to the square of the volume, for the force acting on a given 
colliding molecule is proportional to the number of molecules in the 
interior, which, in turn, is proportional to the density of the gas. But 
the pressure increase is also proportional to the number of molecules 
colliding with the walls, and hence again proportional to the density. 

lhus in place of Boyle’s Law we write the following equation for one 
mol of gas: 


p+5)<— *>=(*+ 


(17) 


The product is again set proportional to the absolute temperature, 
but the constant R is to be replaced by gas constants R' which diffar 
somewhat for various gases. The resulting equation of state is named 
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after 


van der \\ aals, to whom most of the above considerations are due: 



(v - b) = R'T 



In order to obtain a view of the relationships embodied in this formula 
we draw the lines T = const, in the v_p-plane (fig. 1). These lines are 
called isothermal lines, or simply isotherms. The isotherms of an ideal 
gas are the rectangular hyperbolas pv = const. The greater the value 
of T , the less important are the correction terms in van der Waals’ 
equation. This means that the isotherms for higher values of T are 
not sensibly different from the hyperbolas of an ideal gas. For small 



values of T, on the other hand, the isotherms are of a different type, 
each one having a maximum and a minimum of p. The transition 
between the two types is a curve having a point of inflection with a 
horizontal tangent at which both extrema coincide. If, now, the 
point representing the state of a gas moves inward from large values 
of v along a low isotherm, the pressure increases as the volume is 
decreased. This is necessary in order to reach equilibrium, since we 
may imagine the volume to be decreased by loading a piston which 
co nfin es the gas in a cylinder. After we reach the maximum of the 
curve, however, the pressure begins to fall as the volume is further 
decreased. There is an unstable region from A onward, since the 
internal pressure no longer increases if we seek to decrease the volume 
by loading the piston. Actually, this portion of the curve is not tra- 
versed at all; part of the gas begins to liquefy at B , and the pressure 
remains constant as the volume is further decreased. In the meantime, 
more and more gas is liquefied, until it is all in the liquid state at C, 
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and the remaining branch of the curve gives the connexion between 
volume and pressure of the liquid. We shall find later that the hori- 
zontal line is to be so drawn that the two shaded areas are equal (cf. 
Ex. 124, p. 524). 

Curves having no extrema can have no regions of this kind; thus 
is explained the fact that above a certain critical temperature it is im- 
possible to liquefy a gas, no matter how great the applied pressure. 
The corresponding curve is the one having the point of inflection. 
The co-ordinates T k , p ki v k of this point are called the critical tem- 
perature, critical pressure and critical volume. The following equations 
hold for these quantities: 

R'T k a 

(19) 


and since the tangent is horizontal, 


\ 9 V* («* — 6) 2 ^ 


2 a 


v 


8 * 


( 20 ) 


Since there is a point of inflection, 


(*E\ - 0 - _ 

V>7* (v k ~bf v k * ’ 


These equations are readily solved for the critical values; for 
example, division of (20) by (21) yields 

, , v k — 36, (22) 

whence, from (20), 

T k — 8a 


27f^'6 , 


and from (19), 


(23) 


Pk = 


a 


276 2 * 


(24) 


Combining the last three equations give 

PM _ 3 
R'T k ~ 8 


(25) 


to r ^ty ritiCal POint ' F ° r 811 ideal gaS ’ the ex P ression P V I RT is equal 

It is also possible to express the three constants of the van der 
Waals equation in terms of the critical constants: 


6 = 


p a = 


_ 8 p*v* 






3 T h 


. . (26) 


J 
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8 j) k v k T 
3 T k ' * 


If we now introduce the following “ reduced variables 





we obtain an equation free of the constants of individual gases: 


(n -f- (3o> — 1) = 8r. 



The existence of a reduced equation of state depends solely upon 
our being able to represent the facts by a formula containing three 
constants* This is possible only to a limited extent, since many sub- 
stances undergo chemical changes as the density increases (e.g. as- 
sociation). Such changes are not accounted for by the van der Waals 
formula or by any other three-constant equation, since they are entirely 
of an individual nature. The existence of a general equation of the 
type (29) is known as the Theorem of Corresponding States. While 
the above considerations show that this theorem may not be valid in 
some cases, it may usually be employed advantageously in dealing 
with chemically similar substances. 

Ex. 119. Derive expressions for the compressibility (or for the volume modu- 
lus of elasticity) of an ideal gas and of a van der Waals gas. 

* A reduced equation of state is still obtained if an individual gas constant is used 
in place of the universal constant R. The former, too, is eliminated in obtaining the 
reduced equation. 



CHAPTER XXX 


The First Law of Thermodynamics: 
The Conservation of Energy 


1. Formulation of the First Law. 

The First Law makes a statement concerning the nature of heat. 
It asserts that heat is a form of energy , and that in a closed system the 
total store of energy , i.e. the sum of the heat energy and mechanical or 
electrical energy , is constant. This is the Principle of the Conservation of 
Energy , formulated by Robert Mayer in 1842. Thus, in a closed system, 
one form of energy may be transformed into another, but the sum 
must remain constant. In the atomistic treatment of the theory of 
heat, we shall see that heat is a manifestation of the kinetic energy of 
the atoms and molecules of the substance, the difference between this 
and the evident kinetic energy of large bodies being the degree of 
organization. If, for example, a vessel of gas is in motion with a 
certain velocity, we attribute to all the molecules an observable 
mechanical energy — the common kinetic energy of motion of the centre 
of gravity. But in addition to this, there is the kinetic energy of the 
molecules, flying about at random and colliding with the walls, which 
we term heat. Since disorganization is the essential property of that 
part of the energy known as heat, one can apply the concepts of the 
theory of heat only to systems composed of many elementary par- 
ticles. A single atom cannot be said to have a temperature. 

If a quantity of heat represents a definite store of energy, there 
must be a definite relationship between the calorie and the erg. It has 
been found experimentally (Joule) that 


1 calorie = 4*184 X 10 7 erg; 1 kg. cal. = 4184 joules. 

In what follows we shall assume all energy quantities to be expressed 
in ergs, so that no conversion factors are required in the formulas. 

We can give another formulation of the First Law — one which is 
of a negative nature and which is important on account of its imme- 
diate intuitive nature and its analogy with the remaining laws. Since 
the store of energy of an isolated system is constant, it is impossible 
to construct a machine which does work without obtaining energy from 
an external source. This form of the energy principle is termed the 
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theorem concerning the impossibility of constructing a perpetuum 
mobile, of the first kind. 


t We mu st obtain a mathematical expression of the First Law. To 
this end we introduce the total energy U as a function of the thermo- 
dynamic variables. Evidently this quantity must be a single-valued 
function of the independent variables, say of T and v, for otherwise 
it would be possible to construct a mechanism which creates energy. 
For example, if the energy content were greater when 1 litre of air is first 
warmed from 0° to 100° C. and then compressed to 10 cm. 3 than when 
the same amount of air is first compressed to 10 cm. 3 and then heated 
to 100° C., it would be possible to gain energy by bringing the substance 
back to its initial state by the second process, which requires the 
smaller amount of energy. We must therefore have 


U = U(T , V) or U(T, p) or U(p, V). 
Hence the differential 




is the total differential 



of a function of the thermodynamic variables. 
In contrast with this, the total quantity of 
heat supplied to a system is not a unique 
function of the variables, for it is not neces- 
sary that the part of the energy of the 
system which exists in the form of heat 
be supplied as heat; it may be supplied in 
part, for example, as mechanical energy in 
compression. As a result, a small quantity 
of heat supplied to a system is not the total 
differential of a function of the thermo- 


Fig. i dynamic variables; it is therefore denoted 

by SQ instead of by dQ. 

The internal energy of a system may be increased by supplying 
energy from the outside, part as heat and part by doing mechanical 
or electrical work. If we take every quantity of energy supplied to 
be positive, regardless of what form it is in, the First Law is 


hQ+SW= dU. 



The commonest instance in which work is done is the change of volume 
accompanying an external pressure p. If every element dS of the sur- 
face experiences a displacement 85, an amount of work 

8 IF = — (j) pdSbs = —p 
is done. The negative sign is to be used, since energy is supplied to 
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the system if Ss forms an obtuse angle with dS (of. fig. 1). Now the 
integral (j)dSSs is merely the change in volume dV resulting from 
the displacement. Hence in this special case the First Law may be 

WTltten SQ — pdV = dU, (5) 

or SQ = dU -f- pdY . (5') 


This last form may also be interpreted thus: the energy supplied to 
the system in the form of heat goes in part toward increasing the 
internal energy U and in part to performing external work. 

With regard to the pressure, the following may be added: If the 
outside pressure p on the freely movable piston of a gas-filled cylinder 
is equal to the gas pressure, the piston will remain at rest. If the ex- 
ternal pressure is much diminished to a value p' y a sudden expansion 
will follow and the gas will acquire energy of flow which, however, is 
soon converted into heat because of turbulence. In this instance, then, 
the useful work obtained is smaller than if there had been only an 
infinitely small difference between inside and outside pressure, and 
hence an infinitely slow expansion. The latter condition would ob- 
viously yield the maximum external work, and we term the process a 
reversible one. Unless otherwise remarked, we shall assume reversi- 
bility for all processes discussed. 


2. Specific Heat at Constant Volume and at Constant Pressure. The 
Energy Function of a Gas. 


According to p. 487*, the specific heat per gramme or per mol repre- 
sents that quantity of heat necessary to impart to one gramme or to 
one mol a temperature increase of one degree. One must differentiate, 
however, between the case where heat is supplied at constant volume 
and that where the pressure is held constant while the heat is sup- 
plied. In the first instance no external work is done, while in the 
second, expansion against the outside pressure does mechanical work. 
This distinction is important chiefly for gases, on account of their 
large coefficients of thermal expansion. ‘ If we apply the First Law to 

the energy supplied, the definition of specific heat per mol yields the 
equation 


hq __du-\- pdv 

dT dT ( 6 ) 


Now the energy u is a function of the thermodynamic variables of 

which we take T and t; to be the independent ones. We then have 
ior constant volume 




• • • 
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and the specific heat at constant volume, per mol, is 


- " $); 


. ( 8 ) 


In the second case v is not constant and 


Sl ~ (S) f + [(If ) + p] dv = C ‘ dT + [(If) + p] rh - (9) 

We now express v in terms of T and p and obtain, for constant 
pressure, 


Division by dT yields 


c v + 


_ /dv\ 1T 
\d T) dT - ■ ■ 

m,+ ”] m 


. (10) 


c '-[(ll) r +7, ](S) p (n) 


This equation takes a particularly simple form for ideal gases. In 
this case the energy function u{T, v) may be given explicitly. Since 
there are no forces acting between the molecules of an ideal gas, the 
energy must be independent of the distances between molecules, i.e. 
it must be independent of the volume of the gas. Hence, for an ideal 
gas 



This, together with equation (8), gives the energy function of an ideal 
gas as 

u = f c v dT -r (13) 

Jo 

Here u 0 is a constant of integration independent of T and v the 
zero-point energy — which is set equal to zero in the classical thermo- 
dynamics. The fact that the internal energy of an ideal gas is inde- 
pendent of the volume is experimentally verified by the expansion 
experiment of Gay-Lussac. Two large glass bulbs connected by a tube 
fitted with a valve are immersed in a water bath which serves as 
a calorimeter. At the start one blub is evacuated, the other is filled 
with air. When the valve is opened, the pressures in the bulbs are 
equalized, but the water bath is found to experience no change 
in temperature, showing that the internal energy of the gas has 
not changed after the expansion. It is true that there occurre , 



XXX.] THE CONSERVATION OF ENERGY 505 

at first, a transformation of heat energy (disorganized energy) 
into kinetic energy of flow (organized energy), but the latter was 
converted back into heat when the gas impinged on the walls of 
the originally empty bulb. In all, the heat energy of the gas remained 
unaltered. 

The equation of state may be expressed as 

/dv\ _ R 

W, v 

Using this and (12) in (11) we have 

c p - c, = n (14) 

for ideal gases. This relationship is known as Mayer’s equation. 

For a real gas the internal energy is, to a certain extent, dependent 
upon the volume, for if there are forces of attraction between mole- 
cules, work is done against these forces during the expansion. The 
greater the intermolecular distances, i.e. the greater the volume, the 
greater is the potential energy corresponding to these forces. Using 
van der Waals’ equation, it is easy to obtain an expression for the 
dependence of mtemal energy upon volume. If there is a small positive 
change of volume dv , an amount of work 

dW = + J dv (15) 

is done against the cohesive forces. Since the added term for real 

gases is always small, we set the internal energy equal to the sum of 

a term depending only on the temperature and one depending only 
on the volume: 

M, L. (£)-£. . . (.6) 

If no energy is supplied from without, the change of the second term 
is equal to the work done on the system: 


du * = dv = +dw = + £*»; 


hence 

V/ 

a 

^2— “> • • 

(17) 

and since 

(du\ 

\df)r c ”’ 

tt= f 0 c ' ldT ~ l + “»■ 

. . . . (18) 
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If the specific heat is independent of the temperature (this is true 
about as far as 1000° abs. for all gases except hydrogen), then 


u 




The constant u 0 is subject to the same remarks as in the case of an 
ideal gas (p. 483). 

For a van der Waals gas we then have 



Differentiation of the equation of the state gives 



Putting this value in equation (19) and using the equation of state, 
we obtain 

_ R' 

C ° Ct,_ 2 a(v-b)*’ 

xPR'T 

or, replacing R' by R, we have approximately, since a and b are small, 

*(' + Si) - • • • » 

for a van der Waals gas. 


3. Adiabatic Change. 

So far we have considered processes in which one of the primary 
variables T, p or v was held constant. Instead, a process may be 
characterized by holding constant any function of the thermodynamic 
variables. Adiabatic processes are of this kind. The function which 
is held fixed in this process is the entropy; this function will be dis- 
cussed in the next chapter. Adiabatic processes are, as the name 
indicates, those in which no heat is communicated to or abstracted 
from the system. This condition may be realized in practice in two 
ways : either the entire system is enclosed in a heat insulating envelope 
or else the processes take place so rapidly that there can be no appre- 
ciable heat transferred by conduction even when the heat insulation 

is imperfect. , 

The First Law furnishes at once a differential equation lor tbe 

adiabatic process; 

du pdv = 0 (H) 



the conservation of energy 
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If u is expressed in terms of p and v, this equation represents a family 
of curves known as the adiabatic lines or simply adiabatics. The 
equation of the adiabatics is especially simple for ideal gases: since 
we have 


du — c v dT 


and dT = 


pdv-\- vdp 
R 1 


and since R= c v — c„, (21) becomes 

c,dv dp = 0 

c v v p 


Using the symbol y for the ratio cjc v and integrating, there 
results 


pvy = p Q v Q v = const. 



Since y > 1, the adiabatics are always steeper than the isotherms 
(pv — const.) in the pv-diagram. If we represent the relationship 
between T> p and v in a rectangular system of co-ordinates, equation 
(23) represents the projection on the pu-plane of a space curve lying 
on the surface pv = RT. Making use of the equation of state, we find 
that the other projections are given by 

2V- 1 = jTqV 1 = const (24) 

and Tp^-yUy = T 0 p 0 Q--yVy = const. . . . (25) 


Thus y may be determined for an ideal gas from the adiabatic change 
in volume. A method which is more general, since it is not restricted 
to ideal gases, is that which employs the velocity of sound in the gas. 
The relationship 




modulus of elasticity 
density 



given on p. 183 for longitudinal waves in solid bars is valid also for 
fluids if we use the volume modulus of elasticity (see p. 213). If we 
insert the numerical values of density and modulus of elasticity foi 
air, which (Ex. 119, p. 500) is equal to the pressure, for an ideal gas, 
we obtain a value of v which does not agree with observation. Newton 
was the first to compute the velocity of sound in this way. The source 
of the error in his result was pointed out by Laplace. It is not that air 
was incorrectly assumed to be an ideal gas; the discrepancy is of a 
much greater order of magnitude. The true reason is that the compres- 
sion take place so rapidly in a sound wave that the resulting heat does 
not have time to dissipate itself; hence the process is adiabatic, while 
the modulus of elasticity in the example, p. 500, assumes that the 
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temperature is constant. Now it is readily shown that the adiabatic 
modulus of elasticity is equal to y times the isothermal modulus. To 
prove this, take p and v as independent variables. The differential 
equation of the adiabatics is then, by (5) (p. 503), 

Qv), dp + [ C^), + p ] dv = °- • • • (27) 


On account of equation (8) (p. 504), this gives 




The existence of an equation of state connecting the variables, say 
f(p t v, T) = 0, implies 



d p+¥v dv + 



so that, as at (5) (p. 494) 





Thus we need only show that the bracketed expression in (28) is equal 
to c v in order to prove our statement. If we now take T and p as inde- 
pendent variables, we have, by the First Law, at constant pressure, 


whence 



Thus (28) becomes 


. . ( 31 ) 



XXX.] 


THE CONSERVATION OF ENERGY 


509 


and so, from the definition of E, 


E id = yE j 


(31 # ) 


The value of the adiabatic modulus is obtained from the speed of sound, 
while static experiments give the isothermal modulus. Hence y may 
be computed. 

An adiabatic process which has found an important technical 
application in the air liquefying machine (Linde) is the so-called 
throttled expansion of a real gas. In the Gay-Lussac expansion 
experiment (p. 504) the gas in the first bulb attained kinetic energy of 
flow and was at the same time cooled; the energy was again converted 
into heat, however, when the motion was arrested in the second vessel, 
so that within the somewhat large limits of error no net change in 
temperature was observed. The accuracy of observation is increased 
very much if the expansion is regulated so that no energy is converted 
into energy of flow. This is the so-called “ porous plug ” experiment 
of Joule and Thomson (Kelvin). With this arrangement it is not 
necessary to wait for the system to come to equilibrium; rather, the 
temperature of the expanded gas itself may be measured. Since this 
gas has received no appreciable kinetic energy, its temperature is a 
measure of the change of internal energy accompanying the increase 
in volume. The apparatus is represented diagrammatically in fig. 2. 

By continuous motion inward 

of the piston 1 and continuous Pi ^ 

withdrawal of piston 2 the pres- Z_U 1 D I — B 

sure Pi is maintained on the left, Fig. ^ 

the pressure p 2 on the right. In 

the original experiments the throttle D was a plug of cotton or silk. 
If one mol of gas is forced through in this manner, the amount of work 
done on the left is p x v 1} while the gas has done an amount of external 
work p 2 v 2 on the other side. The increase in the internal energy of 
this mol of gas is then the excess of the applied work over the work 
obtained from the system: 

* 

W 2 = p X V t — p 2 V 2 

or u 2 + p 2 v 2 =u 1 + p x v v ..... (32) 

In this process, then, the “ heat function ” or enthalpy , or total heat 
h = « + pv remains constant. Since each term is a function of the 
thermodynamic variables, the enthalpy is also a function of these 
variables. The change in temperature A T accompanying a change in 
volume Av is readily given for a van der Waals gas: 


h = 


I 71 a./R'T a\ 

w o + o v l b ( 7 -)v 9 

v \v — 6 v I 2 / 


(33) 
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A A =0=c.AT + 2 “^ + ZLAr- 

V- v — b (v — b) 2 v~b 




(34) 


Neglecting b as compared with v and making the further approxi- 
mation of replacing R' by R and c v -f R by c„, we obtain, after solving 
for AT: 


** = ^ mb - 2 a) 

V c v 


(35) 


In this experiment the temperature difference that arises repre- 
sents the difference between two opposing processes. Expansion 
makes the attraction between molecules produce cooling, since work 
must be done against these forces. On the other hand, the space occu- 
pied by the molecules, which is characterized by 6, is responsible for a 
warming effect when the gas expands. 

Of greater importance is a knowledge of the change in temperature 
as a function of the pressure change A p. To obtain an expression for 
v in terms of p which is accurate to the same order as (35), we may 
replace v by the equivalent expression RT/p from the ideal gas equa- 
tion and obtain 

AT ^ 

Since A p < 0, there will be a decrease in temperature as long as 




The numerical values for air are such that a considerable cooling 
effect is obtained even at 0° C., and this effect is employed in the Linde 
machine to cool the following masses of air. For hydrogen, however, 
there is a temperature rise when the exit temperature is 0° C., hence 
this gas must be pre-cooled with liquid air. 

In reality, the approximations made above are quite crude in the neighbour- 
hood of the liquefaction point. If (34) is solved for AT without neglecting any 
terms, the factor of Av when set equal to zero no longer yields a fixed transition 
temperature but an equation connecting T and v — a transition curve. This shows 
that even at lower temperatures there are pressures at which the sign of the 
effect reverses (see following exercise). 

Ex. 120. Find the transition curve in terms of reduced variables. 


4. Application of the First Law to Thermochemistry. 

In all chemical reactions we observe a heat exchange, i.e. the 
reaction is accompanied either by the evolution of heat a so-called 
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exothermal reaction — or heat is absorbed in the process — an endo- 
thermal reaction. Most chemical changes are of the former type. 
If the reaction takes place in a calorimeter, the heat of reaction may 
be measured. Contrary to our former practice of counting heat sup- 
plied to a system as positive, the heat of reaction in thermochemistry 
is taken to be positive for exothermal reactions. This will be expressed 
by representing the reaction heat by = — Q\ Q~ is measured in 
thermal units (calories), and is read “ Q minus ”. 

If the reaction takes place without doing external work (i.e. for 
gases, at constant volume), the heat of reaction gives the change of 
energy of the system resulting from the reaction. On the other hand, 
if the process takes place at constant pressure, then — practically 
only in the case of gases — the external work must also be taken into 
account. If the volume of the system is V 1 before the occurrence of 
the reaction and V 2 after, this work amounts to p{ V 2 — F x ). Desig- 
nating the two heats of reaction by the subscripts v and p, as done 
above for the specific heats, we have, at constant volume, 

Q~ v = U 1 -U 2 (37) 

At constant pressure, 

Q~V =U 1 —U 2 — P(V 2 — VJ 
or Q~ v = U x + pV 1 - (U 2 + P V 2 ) = W 1 — W 2 

Thus the heat exchange at constant pressure gives the change in 
enthalpy E before and after the reaction. Since it is sufficiently 
accurate to consider the gases ideal in thermochemistry, we have 

pV x = r^RT 

and so Q- p = Q~ v + (^ - n 2 )RT = Q~ v + AnRT y . (39) 

where An is the difference in the number of mols of gaseous com- 
ponents of the system before and after the reaction. 

For reactions at constant pressure, the enthalpy is the same as the 
energy content; moreover, in no case do these two quantities differ 
much. We adopt the following notation for thermochemical equations: 
The enthalpy H of one mol of a substance in the liquid state is de- 
noted by the chemical symbol written without brackets, for the solid 
state by the symbol in square brackets, and for the gaseous state in 
parentheses. Thus the thermochemical equation 

[Pb] + [S] = [PbS] 4- 18,400 cal 

op [Pb] + [S] - [P6S] = 18,000 cal 

means that one mol of solid lead and one mol of solid sulphur together 
have an energy content 18,400 cal larger than one mol of lead sulphide. 
The mam significance of the First Law is that energy content 
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and enthalpy are functions only of the thermodynamic variables; 
hence the energy differences must always be the same, whether the 
reaction consists of a single process or involves intermediate steps. 
This fact was expressed by Hess before the First Law was known. 
Hess s Law enables us to combine thermochemical equations; this 
often makes possible the calculation of the heat of reaction in cases 
where it would be extremely difficult to measure, or for reactions which 
are purely hypothetical. In organic chemistry, for example, the heat 
of formation of a complicated compound is determined by combustion, 
first of the compound and then of the constituents. The combustion 
usually takes place in a bomb calorimeter at constant volume. In 
order to show that this method yields a sufficient number of equations 
for calculating the heat of formation we consider the formation of 
liquid benzene from solid carbon and gaseous hydrogen, the process 
occurring at constant pressure. From the reaction in a bomb calori- 
meter (constant volume) we find 


C 6 // 6 + ¥(0 2 ) - 6(C0 2 ) - 3 II 2 0 = 781,000 cal. 


Since the data on the remaining reactions refer to constant pressure 
conditions, we convert the above reaction to constant pressure also. 
In the process under consideration, 3/2 mol of gas vanishes, and since 
R is approximately equal to 2 cal, we must add 900 cal to the right- 
member of the preceding equation, assuming room temperature to be 
about 300° abs. This quantity of heat is relatively small compared 
with the total amount, showing — as indicated above — that no great 
error is involved in replacing Q~ p by Q~ v , especially since the heat 
of reaction cannot be measured with any great accuracy. In addi- 
tion, we have the following data for the combustions at constant 
pressure : 

3 (H 2 ) + f(0 2 ) - 3 H 2 0 = 205,200 cal 
and 6 [C] + 6(0 2 ) - 6(C0 2 ) = 566,400 cal. 

Increasing the right member of the first of these three equations by 
the required 900 cal and subtracting it from the third we obtain 

6[C] - i (0 2 ) + 3 H 2 0 - C 6 H 6 = - 215,500 cal. 

Adding the second equation to his, 

6[C] + 3 (H 2 ) - C 6 tf G = - 10,300 cal. 

Thus the heat of formation of benzene from carbon and hydrogen is 
negative — energy must be supplied to make the reaction occur. It 
must be mentioned, however, that the heat of formation is not the 
correct criterion for the occurrence of the process, although in general 
— as in the present instance — exothermal reactions are easier to pro- 
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duce than are endothermal. This is known as Berthelot’s Principle. 

The First Law of Thermodynamics enables us to find the law of 
dependence of heat of reaction on temperature. For a reaction taking 
place at constant volume, 

v a a -f- v b b -f- . . . = v c c + v d d -}- . . . , . . (40) 


where v a , v b> . . . are the numbers of mols, and a,b, . . . are the formula 
of the substances entering into the reaction. At a temperature T , 

{v a a) T -\- (v b b) T + ...—(v c c) T — {v d d) T — = (11) 


and at a temperature T dT, 

( V a a )r+4T 4“ { v bb) T+dT 4" • • • ( V c^)r + rfr [ v dd) r+dr ’ • • • 

= g-AT)+ d ^dT. (42) 

Now at constant volume 

du — c v dT, (43) 

so that 


K fl )r 4- v a c va dT + K&) r 4- v b c vb dT + . . . 

( v c c )t v c c ve dT ( v d d) T ( v d c vd )dT — ... = $“ v (2’)4 — ttjt dT , (44) 


or 


dq 


dl 


dT 


— X ViC vi . 


(44') 


In this summation, the number of mols is to be counted positive before 
the reaction and negative afterward. Exactly the same equations 
result for reactions at constant pressure, only it is necessary to re- 
member that the heat is supplied at constant pressure, so that c 9 is 
to be replaced by c„: 

^9 v y 

dT~ 


V.-C 


vi 


(45) 


Over a small temperature interval, c„ or c P may be considered constant: 
this is no longer allowable when large intervals are considered. For 
example, if we wish to introduce the heats of reaction at absolute 
zero instead of those observed at room temperature, it is necessary 


q * (T) “ Co< dT and T p{T) = f * c v< dT ' (46) 

J o 

Thrae integrals may be evaluated if the variation of the specific heats 
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CHAPTER XXXT 


The Second Law of Thermodynamics: The Law op 

Entropy 

1. The Carnot Cycle and the Ideal Heat Engine. 

The First Law of Thermodynamics makes a statement concerning 
the energy balance which must be maintained in any transformation 
between mechanical or electromagnetic energy and heat. This law, 
however, does not tell whether or not this transformation will take 
place under given conditions, nor does it say what portion of the 
energy will be transformed. Even before the First Law was discovered 
it was known that any amount of mechanical energy could be com- 
pletely converted into heat at will, e.g. by friction, but only a partial 
conversion of heat into mechanical energy could be realized with the 
aid of heat engines. By studying these relationships, Carnot * was 
able to represent the processes taking place in a heat engine by means 
of a cycle which, as we shall find, represents the best possible heat 
engine. This ideal machine makes use of two large reservoirs of heat 
which are at temperatures T x and T z respectively, with T x > T z . f 
The working substance may be taken to be one mol of an ideal gas 
enclosed in a cylinder equipped with a frictionless piston. By means 
of a frictionless guiding mechanism of some kind, the cylinder may 
be placed in thermal contact with each of the reservoirs in turn, or 
may be completely insulated from them. We provide, further, that 
the processes which the working substance undergoes be reversible. 
By this we mean the following. If the gas is allowed to expand by 
decreasing the external pressure, this must not be done by a sudden, 
finite change of pressure, but in such a way that the external pressure 
differs only infinitesimally from the internal; thus the piston receives 
no appreciable kinetic energy which would require to be removed by 
braking. Under such conditions the process may be carried out equally 
well in the reverse order, provided that the external pressure is a little 
greater than the internal. In this case the work done differs only in 
algebraic sign from that done in the expansion process. It is evident 

* S. Carnot, Reflexions sur la puissance motrice du feu, et sur les machines propres 
d cUvelopper celte puissance, Paris (1824); Eng. trans. by R. H. Thurston (1890). f> 
f These heat containers are sometimes referred to as source and sink or 
as *' boiler ” and “ condenser ” respectively. 

614 
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that the rapidly r unnin g engines used in practice are very different 
from the ideal Carnot mechanism. 

A cyclic 'process or cycle is a process in which the machine is again 
brought back to its initial state; this is a necessary property of a 
machine that is to run arbitrarily long. Let us cause the working 
substance to pass through the following cycle (fig. 1); 




- — .t.w 



_ rift 

jX 
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7 

nnft 


/ 

hi 





Fig. i 


P 



1. Connect the cylinder with the source of heat at temperature T x 
(the boiler) and reduce the external pressure (e.g. by removing weights 
from the piston), thus allowing the gas to expand. In doing so, the 
gas does work against the external pressure and the equivalent amount 
of energy is taken from the boiler. Since we assumed the boiler to be 
very large, we need not consider its temperature drop during the pro- 
cess and so may take the change in volume of the working substance 

to be isothermal. If the gas expands from the volume v. to r* the 
external work done is * i 8 8 


W 12 = / ’pdv = RT l r - = RT t log 


( 1 ) 


T J * 1116 f, yhnd ?, r 13 J now removed from the boiler and insulated 
The gas is then allowed to expand adiabatically until its temperature 

the ™l >PP6d t0 that ° f the condenser - b 7 equation (24) 507) 

the volume u 3 at that temperature is given by ^ 




i/(i — y) 


( 2 ) 


smee T 2 - T v Thus the work done in this adiabatic expansion is 

w ~ 2a= i}dv=p^r~ 


V t 

= 




(3) 


When, infrequently, 'woounfi the^wor^do^tty a° n ° f" a 8ystem is *®okoned positive, 
W- (“ W minus i.e. W ~ - - W. * ** a 8yBtem « Positive we denote it by 
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3. The cylinder is now placed in thermal contact with the con- 
denser (at temperature T 3 ) and the gas is isothermally compressed to 
a volume v 4 such that if it be adiabatically compressed from this point 
to the initial volume it attains the initial temperature T v This 
volume is given by 



since T 4 = T z . The work done in this compression is negative, since 
it is really supplied to the system. Its magnitude is 



f V ‘p dv = RT 3 log l X 



The heat generated is delivered to the condenser. 

4. The cylinder is removed from the condenser, is thermally 
insulated and compressed adiabatically to v v An amount of work 

W- a = r V dv= -2L (T t - 7\) = <r s - z\) . (6) 

J v. y - 1 r - 1 


is thus done. The same remarks apply to the algebraic sign as in the 
third phase of the process. 

The machine itself is now in its initial configuration. A certain 
quantity of thermal energy has been abstracted from the boiler (tem- 
perature Tj) and delivered to the condenser (temperature T 3 ) and, in 
addition, a certain amount of mechanical work has been done. This 
is given by the sum 

+ w- i3 + W' - 34 + ^“41 > 


and is represented in the so-called indicator diagram (fig. 1) by the area 
enclosed by the path covered in the process. Hence external work is 
done when the path is traversed in such way that the enclosed area is 
always on the right, as is evident at once by considering the expansion 
from v, to Vo, in which the work done is represented by the area be- 
tween the curve p =/(v), the axis of abscissae, and the lrnes through 
points 1 and 2 drawn parallel to the p- axis. Adding the amounts of 
work done in the individual steps, one sees at once that the amount, 
along the adiabatics cancel out and there remains 

\V~ = RT l log — + RT 3 log - 4 . • • 

V 3 


. . ( 7 ) 
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V 4__ 


v. 


V. 


. ( 8 ) 


Hence 


v, 


A quantity of heat 


— R (1\ - T 3 ) log ^ 


( 9 ) 


Qi = RT 1 log a. 


(10) 


equivalent to the work done in the first step of the process, was 

removed from the boiler and delivered to the engine. Of this the 
fraction 

v Qi r x 1 i\ (11) 

™ a3 ^e^ nverte ^ ' nto “sefrd mechanical work, while the quantitj 


Q , = f?r 3 iog^ = o 1 ^=g l( i_, J 


( 12 ) 


rf thVwP condenser. The fraction , is called the efficiency 

of the heat engine. Since aU the steps of the process are reversible 
the engine may be used also to extract a quantity of heat Q a from the 

t0 the b0ller - Tkis 


W = qQ-j = 


vQ 


l- V 


• . . . (13) 


means 6 To shoJfV th<S Cam< ? engino is the hi ^est attainable by any 
a perpetual motion machine, since it does not violate the 5W r ^ 

converting r,Q x of this into usefol work Jd retuSin K ° M 

the condenser. This Quantify ^ x. turning (I — rj)Q a to 

immediately by means of a Oarrmf ’ .. owe ver, may be recovered 
requires an amo^unTof work °‘ * 6< l uation ( 13 ). 
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In all, the combination has done an amount of work 



(v — V)Q X 

l-rj ’ 
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which is positive. The thermal equivalent of this work has been ab- 
stracted from the boiler without the delivery of any of it to the con- 
denser, which therefore is not required at all. This means that the 
mechanism would do mechanical work and would cool the boiler 
without causing any alteration whatsoever in the condition of any 
other body. Since enormous sources of heat are available (e.g. in the 
oceans), a machine of this kind would be fully as important as a per- 
petual motion machine. Experience shows, however, that it is im- 
possible to construct a machine of this kind, and this is precisely what 
the Second Law of Thermodynamics says. 

Before we formulate this law, it must be mentioned that a satis- 
factory definition of the temperature scale is given by equation (11) 
(p. 517). Thus, in order to compare the temperatures of two sources 
of heat, we use them as boiler and condenser of a Carnot engine and 
determine its efficiency. The condenser is at the absolute zero of tem- 
perature if the efficiency is unity. This defines the thermodynamic or 
Kelvin temperature scale , which is identical with that based on the 
equation of state of an ideal gas, as is evident from the derivation. By 
means of suitable thermodynamic reasoning and measurements, any 
thermometer may be reduced to the thermodynamic scale. However, 
the details cannot be given here. At not too low temperatures the 
readings of a helium thermometer agree with the thermodynamic scale. 

Ex. 121. Is the heat developed in electric resistors an economical way of 
heating a building? 


2. Formulation and Interpretation of the Second Law. 

In conformity with the result of the previous section we give the 
Second Law the formulation due to Thomson and Planck: 

It is impossible to construct a continuously operating machine which 
does mechanical work and which cools a source of heat without producing 
any other effects. 

This is a fact of experience, just as is the First Law, and can be 
confirmed or disproved only by experiment. We shall find that the 
consequences of the Second Law are exceedingly numerous and that 
the experimental confirmation of these consequences is at the same 
time a verification of the validity of the Second Law, which, moreover, 
has a logical interpretation from the atomistic point of view. 

The first consequence is that certain energy transformations in- 
volving heat are irreversible. A case in point is the conversion of wor 
into heat by means of friction. If, for example, a vessel containing 
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a rotating liquid is left to itself the liquid will soon come to rest, since 
its kinetic energy is soon converted into heat; this causes a slight 
temperature rise which may be measured. It would be an event 
absolutely unheard of, however, if a liquid at rest were to start rotating, 
its temperature dropping at the same time, although this would not 
be a violation of the First Law. If the process opposite to the genera- 
tion of heat by friction existed, it would have exactly the property of 
a perpeiuum mobile of the second kind. If, then, we assume the Second 
Law to be valid, such a process does not exist. 

Another irreversible process is the isothermal expansion of an ideal 
gas without the performance of external work. This is the process we 
met in the Gay-Lussac experiment (p. 504). If it were possible to com- 
press a gas isothermally without using external work, we could construct 
a perpeiuum mobile of the second kind by allowing a gas to expand 
isothermally and do external work, the required energy being taken 
from a reservoir in the form of heat; then, using the previously men- 
tioned process, the gas could be compressed again to its original volume 
without requiring any work. 

Expansion without external work is represented also by the dif- 
fusion of two gases. If we remove a partition separating two gases 
in a vessel, each gas ultimately fills the entire vessel, since the entire 
space is available to the molecules of either gas. According to the 
Second Law, this process cannot be reversed without doing work. In 
this process and in the one discussed in the preceding paragraph heat 
enters only indirectly, since no changes in temperature occur. Never- 
theless, the phenomenon of diffusion throws light on the nature of the 
Second Law. We may represent the circumstances by a model as 
follows. Place a large number of white balls, representing the mole- 
cules of the first gas in the bottom of an urn. Above these place a 
layer of coloured balls, representing the molecules of the second gas 
If the urn is now shaken, the two kinds of spheres will mix. It is im- 
probable to the highest degre^-and the improbability increases with 

fit,T ° f v ba , S the shaking will again produce a con- 

guration m which all the coloured balls are again on top. To be sure 

there is no physical law forbidding such a separation, just as there 

Lm r T n u Why “ numbel of Pieces of type thrown at random 
bo * sb °? ld arrange themselves so as to spell out a given 
sentence, but the Theory of Probability shows that such a ‘‘’elf 

arrangement ” is practically out of the question. The remarkable one" 
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upon the thermodynamic variables. Is it also possible to find a charac- 
teristic function for the Second Law which has this property? The 
Carnot cycle gives us a suggestion. After completion of the cycle, 
i.e. when the system is again in its initial configuration, we have by 
equations (10) and (12) (p. 517), 



• • • • 



We wish to generalize this relationship so that it applies to any cyclic 
process. Imagine such a process carried out with an arbitrary system. 
The cycle is represented by a closed curve in the pF-diagram; the 
direction in which this curve is to be described is that corresponding 


Adlaha&cs 


P 


Fig. 2 


V 



to external work done by the system. Imagine, now, the pF- plane 
to be covered by a fine network of adiabatics and isothermals (fig. 2). 
If the network is taken arbitrarily fine, the work done may be repre- 
sented by the area enclosed by the zigzag line instead of that bounded 
by the curve itself. The former path is traced by moving from a point 
P Q of the curve to a neighbouring point P Y along an adiabatic as far 
as the intersection with the isothermal through P v and then along this 
isothermal to P v instead of moving directly from one point to the 
other along the original curve. 

We specify, in addition, that all changes are to be reversible in 
the sense used in connexion with the Carnot cycle. Further, assume 
that the heat supplied or obtained along the isothermals is not ex- 
changed with the boiler or condenser directly, but that the work 
which can be got from it is really obtained with the help of a Carnot 
engine operating between a body at temperature T 0 and the system 
itself. The Carnot engine delivers to the boiler the heat generated 
along the isothermals, external work being used up in the process. 
Since only a small segment of each isothermal is traversed, the indi- 
vidual quantities of heat supplied to the system are small and so are 




XXXI.l 


THE LAW OF ENTROPY 


521 


denoted by A Q a . The quantities of heat which the Carnot engine 
abstracts from the boiler at temperature T 0 are denoted by A Q~ 0 . 
Part of this heat is converted into mechanical work, while A Q a is 
delivered to the system in question. By equation (12) (p. 517), 


AQ a = AQ- 0 . or AQ~ 0 = Aft, 

fl ± rf 


(16) 


It remains to show that the quantity of heat which must be sup- 
plied along the zigzag path is the same as that supplied along the 
actual curve. To this end we imagine a cycle performed along one 
of the infinitesimal triangles bordering the curve. The work done is 
given by the area enclosed by the triangle, and this vanishes to a 
higher order than the lengths of arc of the curve as the network is 
made finer. Since no heat is supplied along the adiabatic, the sum of 
the heat quantities supplied along the isothermal segment and the 
actual curve must be zero if all sides of the triangle are traversed in 
a given direction; if both segments are traversed in the same sense 
the two quantities of heat must be equal. 

After completing the cycle, the system under consideration and 
the Carnot engine are back in their original configuration, and sum- 
mation over all heat exchanges gives 

W. 

(17) 




X 


or Q~ 0 = T„X ^ 


by 2e Sg ra f 0n ^ SegmentS are VeI ? sma11 ' » replaced 

i d Q« 


f 


T' 


(18) 


so that for a closed path the relationship 

fx-T. » ■ ■ • 

.VwiJ Ji? •SffL'TsSTJZ* ■ 3 r 

Ct,, i- 

instance, the Carnot machine ^aes iuat as Q °,‘ S Z6r °‘ In this 
the system in its cycle H 17 L 1 mu ? b Work “ was d °M by 
states, the mtegral ' * rever8lble between two 
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must be independent of the path, for if two curves connecting the 
states 1 and 2 are joined to form a closed cycle, the integral must 
vanish. It follows that, when the initial state 1 is assigned, the 

integral defines a quantity which is a function only at the final state 
2. The integral 

f dQ<j (rev) 

J T ' 


reckoned from an undetermined initial point, is called the entropy 
S of the system: 

s= f d -Q^ (19) 

Thus, for a reversible passage from state 1 to state 2, 

■s* - -Si = j[* (20) 


For an irreversible change the sign of equality (=) is no longer appro- 
priate. Whether it is to be replaced by > or by < is determined by 
supplementing the process by a reversible one in the opposite direc- 
tion, the whole process being cyclic. Since the cycle contains irre- 
versible steps we must have, according to the statements immediately 
following equation (18), 




S 2 < 0 (21) 


or 




> 0 . 


The positive difference 




is then a measure of the irreversibility of a given process. It is easy 
to see, in special cases, why the value of the integral 

fdQ„ 

J T 

decreases for irreversible cyclic processes: Suppose we supply heat 
to the system by means of a Carnot engine, but, on account of 
friction, only a small part is converted into mechanical energy, or 
heat due to friction is generated in the system itself. Then, in order 
to bring the system back to the initial configuration, more heat, 
must be withdrawn from the system than in the case of a reversible 
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For isolated systems, in which there is no exchange of heat with 
the environment, the integral 

fdQ 
J T 

is zero ; thus, in any change, 

S 2 ~S l > 0. 

All changes within an isolated system 'proceed in such way that the entropy 
increases; hence changes will occur in such a system until the entropy 
has reached its maximum value. 

Thoughtless application of dS = dQ/T to situations involving no 
heat exchange as such may erroneously lead to the conclusion that 
dS — 0. Let us illustrate what is 
meant, using the example of the 
diffusion of two gases mentioned 
earlier. Imagine equal volumes of 
the two gases, one above the other. 

When the separating partition is 
removed they will diffuse through 
each other and nothing else will 
happen. This process, occurring 
spontaneously, is irreversible. In a 
less simple way, but reversibly, the 
same final state can be reached by 
making use of the fact that cer- 
tain membranes are semi-permeable. 

Rubber, for example, is readily permeable by carbon dioxide but not 
by hydrogen. For a gas that is readily transmitted by the membrane, 
the latter simply does not exist. Let the two gases now be placed in 
vessels which telescope one into another. The lower vessel has a lid 
that is permeable by the gas in the upper vessel, the upper one has a 
bottom that freely transmits the gas present in the lower vessel (fig 3) 
The entire apparatus is in a vacuum. Thus no work is required when the 
vessels are moved together. When the two containers have been fully 

telescoped the gases are mixed, but the volume is half that occupied 
wnen tnev were mixed hv diffusion Tr ? v ■ ■ • * 


/ 



/ 

l\ 

2 


2 


Fig- 3 


mixture 

external 


• 4 nuiAi UUt we 

must supply the equivalent amount of heat to the mixture in order to 
keep the temperature constant. This quantity of heat, divided by T 
gives the change in entropy m the process of irreversible mixing. For 

ditin ™ i! t \ telned b ? T the r ® verslble P rooess > we return to initial con- 

i* “f “ tlus Ty : Y Smg the work done in tbe expansion we oompress 
the mixture, return the resultant heat to the heat reservoir and separate 
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the gases by means of the selectively permeable membranes with- 
out any expenditure of work. 

nonflhfiiP *? a qUanti i 7 which » hke temperature, may be specified numericaUy, 
Z . 0nes 1* T degTee. The practice of giving numerical values for this 
quantity is less common because earlier there was no fixed zero point for its 

moo™! ' ° ne thUS av ? ided such statements as “The body has a temperature of 
10° and an entropy of 300 cal./deg ” However, since the formulation of the 
Jsernst heat theorem (p. o56), a definite reference level for entropy is indicated 
and such specifications are quite in order. 

* A 22 ' Calculate the change in entropy occurring when 10 gm. of water 

at 100 C. are mixed with 20 gm. of water at 15° C. 

Ex. 123. In how many points can an adiabatic intersect an isothermal? 

Ex. 124. What determines the position of the line BC in fig. 1 (p. 498)? 


3. Conditions of Equilibrium for Systems under various Conditions. 
Thermodynamic Potentials. 

Let us write (2T) in differential form and express dQ in terms of 
the First Law, which, of course, holds whether we are dealing with 
reversible or irreversible processes: 



dU + 8W~ 
T 


^0 



where W~ represents the external work, and the sign of equality applies 
when there is reversible conduction of heat. The notation SIF~ indi- 
cates that the external work need not be a perfect differential of the 
thermodynamic variables. In the reversible case, for mechanical work, 
8 IF” amounts to -\-pdV, where p is the equilibrium pressure — the 
pressure on the system at any instant. This furnishes us with a de- 
finition of entropy in terms of the thermodynamic variables in the form 
of a differential equation : 



dU +pdV 
T 



In order to compute the state of equilibrium that a system will 
assume under given external conditions, we proceed as in mechanics 
by using the principle of virtual displacements. For a closed system 
characterized by the auxiliary conditions U = const, and V = const., 
SIF” is also zero, and we have 

8S ^0. 


with the auxiliary conditions 

80' = 0 and 8 V = 0. 

This merely repeats the proposition, given above, that an isolated 
system in equilibrium assumes a condition of maximum entropy. 

In general, however, the systems we deal with arc not closed. Most 
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laboratory processes, for example, proceed at constant temperature 
because of heat exchange with the surroundings. For T = const, and 
a reversible process, (21") yields 

dW- = -d(U - TS) = -dF, . . . (23) 

The function F is called the free energy. At constant temperature its 
decrease gives the maximum obtainable external work. Thus the free 
energy has a very vivid interpretation: Of the entire energy content, 
a part is available for any transformation, while the remainder is fixed 
and is converted into heat. In general, according to (21"), 

S W~ <; — dF (24) 

Now if the volume does not change (isothermal-isometric system) no 
work is done, and on account of S W~ = 0, 


0 


8F 


01 BF < 0, 

with the auxiliary conditions 

sr = o, bv = o. 


• • 


. . . (25) 


Al eyuiiibriwn, an isothermal-isometric system has its minimum value 
the free energy. Equilibrium is characterized by the sign of equalitv 
because changes will take place spontaneousl/until 
value m reached and-like any extremum-tlds is characteTed by 

a revirSr “ iS ° themlal is Md for 


or since p = const., 


81^- = pdV = — d(V — TS), 


d(U ~TS + pV)=dG = o. 


(26) 


“ uSizzst? & “ ,u r"i ! » *—“■»» 

that the enthalpyappearain fp" eMrgy \ the circumstance 

in any isothermafsjstem P ° f ^ energ ^ Aocord “S to (24). 

W- <S 8(TS - U ), 

but since SW- <; 8(pV) and p = const., we have 

HpV) :£ 8{TS—U), 

"a IS + rV> - 80 s “■ 1 

ST = 0, Bp = 0. 


• • ( 27 ) 
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An isothermal-isobaric tends towards a minimum value of the free en- 
thalpy , and this value characterizes the equilibrium. 

The functions introduced here are called thermodynamical potentials. 
1 he name comes from their formal analogy with the potential in 
mechanics. In the same way as the components of force are obtained 
by differentiation of the mechanical potential with respect to the 
co-ordinates, so all other thermodynamic variables are obtained by 
differentiation of the thermodynamic potentials. This makes un- 
necessary the introduction of new constants of integration which 
might give rise to difficulties. The potentials must be given as func- 
tions of two properly selected independent variables. The appropriate 
variables are those which, together with the variation of the potential 
in question, occur in the auxiliary conditions. The variables appropriate 
to each potential are as follows: 


Potential* 

Entropy S 
Free Energy F 
Gibbs Function G 


Independent Variables 

u , V 
T, V 




We then find the following relationships: From S = S(U, F) it 
follows that 




dU+pdV 


Comparison of coefficients gives 

1 _ ( dS \ . V _ (zs\ 

t \du) y ’ t VaF/; 



In the same way, it follows from F = F(T , F) that 




dV= dU - TdS-Sdl, 

T 


which, by equation (22) (p. 524) is equal to 

dU - dU - pdV - SdT. 


Comparing coefficients, 





• The notation used here for the potentials is largely that of Clausius. Gibbs called 
these functions v , ifr, and £ in the above order. He also denoted 
by e and the absolute thermodynamic temperature by t. See J. W. Gibbs, Collected 
Works, 1, New York, Longmans, Green & Co. (1928). 
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Also, from G{T, p) we find 


" - d£)r + 


( d G\ 

~ ) dp = dU — T dS - SdT + pdV -f Vdp, 

which is equal to 

-SdT + Vdp 

by equation (22). Hence 

The relationships derived from E = H(S, p) are less important. It 
follows from equations (31) and (32) that the free energy and the 
Gibbs Potential satisfy the differential equations 


3G\ 

?V/t 


( 31 ) 


F = U + X fli\ 


ax; 


G = U + pV + T 




(32) 


(33) 


These are obtained by substituting the expressions for R in 
equations defining the functions F and G eXpreSS,0DS for 5 111 the 

such iM °* p ™ sioas earing into these thermodynamic potentials are 

SHI irsrSHf -4 

4 ‘ ST t? fr 6611 the 1114611181 En6rgy and the Equatioa of State. 
Ration mi o 

SStoSTff v e %at p r rnT; a ha? T (p - r > *° m **£52 

for one mol: Md * as mde P««ient variables, we have 


*-(p)" + G;)* 


Comparing coefficients, 


r [(S).‘ ,r + ( 


£)* 


+ P<foJ. 


(34) 


/9f\ _ 1 /du\ 

VT/. T \3T/ v (35) 

(^), = T [(§?),+ *]• .... ( 36 ) 
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If we differentiate (35) with respect to 
respect to T, comparison gives 


v and differentiate (36) with 



Using the equation of state of an ideal gas, this relationship leads to 
the independence of the internal energy and the volume; if the van der 
n vfls equation * we ob tain equation (18) (p. 505). From equation 
(11) (p. 504), we obtain the general relationship for the difference of 
the specific heats: 



Ex. 125. In a voltaic cell whose electromotive force is <X> aD amount of work 
® de I s done when a quantity of electricity de passes. The expression <£>de thus 
appears in place of the mechanical work pdv. The energy change is determined 
by the fact that an amount of heat equal to the heat of reaction per gramme equi- 
valent is generated when 96,496 coulombs of charge have been transported. 
What relationship exists between the heat of reaction, the electromotive force, 
and its temperature coefficient? Hint: Introduce the charge e and the tempera- 
ture T as independent variables. 

Ex. 126. Show that for T and p as independent variables the enthalpy h takes 
the place of u and that for (2 h/dp) T we get a formula analogous to (37). 


5. Electrocaloric and Magnetocaloric Phenomena. 

In our considerations of energy relating to electrical phenomena 
we did not take heat energy into account. Strictly speaking, we would 
be concerned with the quantity designated there by u — the free energy; 
but since free energy and true energy differ but little at room temper- 
ature (they are identical at absolute zero), little error is involved. It is 
only when we study the thermal effects connected with polarization 
that this difference must be kept in mind. 

As a starting point we use equation (36'), p. 290 , according to 
which an amount of energy E dV is supplied to each cubic centimetre of 
a body when its polarization is changed. If, in addition, heat is sup- 
plied (but not mechanical energy), then the First Law prescribes that 

du = 8q + EdP (39) 

We are able to work with merely the magnitudes of the vectors, since 
P and E have the same direction. Comparing this formula with the 
one that holds for mechanical work, we see that — E appears in place 
of p and P replaces v. Inasmuch as the polarization P usually refers 
to unit volume, the formula as it stands is valid for a single cubic centi- 
metre of material. However, it may be thought of as applying to one 
mol if we interpret P to be the polarization per mol. The same equation 
is valid for the magnetic case also. In order to avoid confusion with 
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other quantities, let the magnetization per mol be denoted by l. Since 
the magnetic case is of much greater practical importance, the further 
development will be given in terms of these phenomena. If the me- 
chanical quantities in the equations of the last section are replaced 
by the magnetic ones, we obtain at once the difference of the specific 
heats at constant magnetization and at constant field. To interpret 
the result, consider a paramagnetic substance placed between the 
poles of a magnet and warmed. This lowers its paramagnetic sus- 
ceptibility and so, at constant field strength, its magnetization. In- 
stead, however, we could hold the magnetization constant by increas- 
ing the field. The detailed computation is not difficult (cf. Ex. 127, 
p. 530). 

Of considerable experimental importance in recent years is the 
temperature change accompanying adiabatic processes, because by 
their use we are able to approach closer to the absolute zero than by 
any other means. Since adiabatic processes are characterized by 
constant entropy, we must write down this function for a magnetized 
body. Transcription of equation ( 37 ) yields 



In studying paramagnetic susceptibility we found that as long as the 
elementary magnets do not influence each other— and hence the Curie 
law rather than the Weiss law is valid — l is a function only of HIT. 
Reciprocally, H/T is then a function of l alone and so the derivative 
on the right side of the above equation vanishes. Thus the internal 
energy of a magnetized body is independent of the magnetization just 
as the interna energy of an ideal gas is independent of the volume. 
We conclude that a material obeying the Curie law corresponds to an 
ideal gas. As in the case of gases, the departures from the law, which 
manifest themselves m an internal field, become increasingly important 

of ( 34 ) toom^ tUreS ’ F ° r an ldeal magnetic material » the transcription 

* = i [(S)/* r ~ Hd *\ (4i) 

SenTroot fP res “ te the magnetic contribution to 

tne entropy. The specific heat is to be substituted for the entire first 

term; and since we deal with solids or liquids no di inction need be 

made between c and c„ Note that, according to the above there is 

witWW r ® valu ? te the magnetic share of the entropy we start 

function of wherTf^ Sf* * 1* P™g» etic body is a 

wllere m the magnetic moment of the atom in 
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^ [V<*> ~JA*)J*\- (42) 

' w “ “* ta “» y — »<■ 


~R(x taahx — log cosh a;) == O (jjjS- 


(43) 


In an adiabatic process the total entropy is constant, so that 


— - ^ r - — 

8 = J f dT + ® (j.f^ = const - . . . 


(44) 


On account of the dominance of the first term the function O is nega- 
tive. If the field is switched on at a given temperature T the negative 
term appears and so the first term must increase in order that the 

entropy remain constant. This means that the temperature must 
increase to a value T + AT given by 


l <«> 

For the small term O (fiH/kT) we may use merely the initial temper- 
ature T. Assuming an average value c of the specific heat over the 
interval A T, (45) yields 

f <«■> 


T + AT 


Conversely, on suddenly removing the field there must occur a drop 
in temperature whose magnitude is given by this same equation; and 
in the neighbourhood of absolute zero, where other adiabatic processes 
fail, this temperature drop may be of considerable importance, granted 
only that l depends solely on HIT (see above). By this means, temper- 
atures of only a few hundredths of a degree absolute have been attained. 
Since other methods will not work, the measurement of these temper- 
atures is accomplished by means of susceptibility measurements. Here 
the problem arises of computing the true temperatures from readings 
on the Curie scale, the two differing because of the appearance of ferro- 
magnetism. The difficulty has been overcome for a substance such as 
ferric ammonium alum. 

Ex. 127. Compute the difference of the specific heats at constant field and 
constant magnetization in a region where l = CHjT. 



CHAPTER XXXII 


Application of the Second Law to the Calculation op 
the Equilibrium of Thermodynamic Systems 


1. Gibbs’ Phase Role: a General Theorem on the Maximum Number of 
possible Phases. 

Consider a perfectly general system containing a independent 
components (substances) in phases. We shall find presently that £ 
has a maximum value, determined by the number of independent com- 
ponents. We indicate the components by subscripts and the phases 
by superscripts attached to the letters. Thus the mass of the ifch 
component in the /.ill phase is represented by M { *. 

The equilibrium is considered when the system is at a given tem- 
perature and a given pressure; thus we consider it an isothermal- 

lsobaric system rather than an isolated one. In this case the Gibbs 
Potential 


G = U - TS + pV 


(1) 


is a minimum, according to p. 525. Of the quantities entering here, 
i and p are entirely independent of the mass of the system. On the 
other hand, U, V and S are proportional to the mass, for given tem- 
perature and pressure. From this, one sees that G is also propor- 

T 6 ma8S: i thUS if a “ masses 816 doubled - 6 becomes t£ice 
as great. In general 


G(qM v qM t , . . . J = qG{Mit > (2) 

But this is the definition of a homogeneous function of first degree 
theorem * homo g eneous Actions of nth degree the Euler 


Mi m. + M ‘ 


SG 

dM 


+ . . . 


2 


dG 

M ‘ fM. = nG(iV/ i> M * ^.) (3) 


is valid. In our case this is 

Ml or, + M, ‘ '-m 2 + • • • M ° -v.). (4) 

To show this, it is necessary only to differentiate equation (2) with 

W1 
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iToTm fi°J 7 d "*•* = u L The nght meraber is a homogeneous func- 
“ , f l , degre ® m tbe masse3 - In Mder that the left member be 

%£7£ ZZ zeT0 ’ Le ' the derivaiives befunctions *•** °f th ° 

After these preliminary remarks concerning the properties of the 
Junction 6 } we return to the question under consideration. The quan- 
titles V V and S are additive, e.g. the internal energy of a system is 

,T a i th ® ° um ° { tbe energies of its parts. Hence if we distinguish 
the Gibbs Potentials of the several phases by accents, we have for the 
entire system 

G=G' + <?'+...G* (5 ) 

The <?* have, of course, the properties deduced above. Besides, they 
depend also on T and p. The condition of equilibrium is 

8G = SG' -f- 8G" + . . . 8G P = 0, .... (6) 


with the supplementary conditions that ST = 0 and 8p = 0. In 
addition to this, the mass of each component must remain unaltered; 
the distribution of a given substance among the various phases may 
change, but the total mass of the substance remains constant. We 
then have the following a auxiliary conditions also: 


SM/ + + . . . S Mf = 0, 

8M 2 + 8M 2 ' "}“••• M1 2 P = 


m a ' + 8M a " 4- . . . M1J = 0. ) 



Taking the variation of G while T and p are held constant, we obtain 
the equation 


8G 


= 0=~ 8M X + W / + mi: 


+ 


dM x 

dG" 


diXl, 


bm: 


a 

// 


dM t 


n 


m ^ 2 " + . . . ™ Ml . " 


dM 2 " 


dM 


+ • • • + m “ ' + m? m m " 


( 8 ) 


Since the variations are not arbitrary, but are subject to the 
conditions (7), wc apply the method of Lagrangian Multipliers 
(cf. p. 115), multiply (7) by X v A 2 , . . and add them to the principal 

* The partial derivatives occurring here are sometimes referred to collectively as 
the Gibbs Potentials. Gibbs pointed out that they furnish a quantitative measure of 

the concept of chemical affinity. See p. 554i 
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equation (8). Since the coefficients of the 8M t k must vanish separately, 
we obtain the system of equations 


dG' 

dM 1 

dG" 

dM x ' 


; + ^1 — 


dG' 


-f- Aj — 0, 


dM 2 
dG" 


dM 2 " 


7 “b A 2 — 0, . . . 


m? +K=0 ’ 


+ A 2 = 0, . . . ^jj-77 + A a = 0, 


dG* 

dMf 


+ Aj — 0, 


dG* 

dM 2 * 


+ A 2 = 0, 


d& 
d&l * 


+ A_ — 0. 


( 9 ) 


These equations, afi in number, must be satisfied. How many variables 
are at our disposal? First there are the a multipliers — then the deri- 
vatives depend on T , p and the (a — 1) mass ratios, i.e. on (a -j- 1) 
variables. Inasmuch as T and p are the same for every phase, we have 
in all a-]- p (a l)-j-2 free variables. Since the number of equations 
can at most equal the number of variables, 


or 


p(a- 1) + 2 


P^ a + 2 


( 10 ) 


The number of phases may be, at most, two greater than the number of 
independent components. When 0 is equal to a + 2, T and p are com- 
pletely determined, since solution of the system of equations yields 
definite numerical values for these quantities. Thus a system consist- 
mg of a single substance can have three phases at most. For example 
at a definite temperature and pressure water, water vapour, and ice 
can co-exist m thermodynamic equilibrium, but a second form of ice can- 
not be included. In the latter instance there could be no equilibrium 
changes would take place until one phase would be entirely consumed! 

if there are but two phases we still have one variable at our dis- 
posal In a system consisting of water and water vapour the tempera- 

drf!w r at 0UI d “P 08al > but the Pressure is uniquely 

determined by the temperature and is not a second independent 

variable as m the case of a single phase system— say a gas. In general 

the difference between the highest possible number of phases f nd the 

numberof phases actually present is the number of degrees of freedom, 

f=a+ 2-p (U) 

Bdutffin nf3 le ° f a Sy8tem Consi8tin g of substances, consider a 
solution of sal ammoniac in water. Assume a very dilute solution ^ 

If ° f h the Water va P our abov ® the solution. 

h 8 iutl ° n 18 cooled ’ 106 to form at a temperature a little 



534 HEAT. PHENOMENOLOGICAL PART [Chap. 

r,l 0 Ti°° W e „ now h ave a third phase: ice. According to the phase 
rule there is still one variable free ; take this to be the temperature 
The concentration is determined by this variable, for if we continue 
to cool the system, more ice is formed, the solution becomes more 
concentrated and so its freezing-point sinks. The last fact is one of 
experience; a thermodynamic derivation of the result will be given 
on p. 545. Finally, the solution becomes saturated and the solid salt 
also precipitates from solution. The temperature at which this occurs 
is called the eutectic point ; this is the lowest temperature the system 
can attain. At this quadruple point the vapour, solution, ice and salt 
are all in equilibrium. If, on the other hand, we start with a highly 
concentrated solution, the solid salt comes out of solution as the third 
phase when the temperature is lowered. Additional cooling causes 
the concentration to decrease further. Thus the curve of concen- 
tration versus temperature is also the solubility curve. If the eutectic 
point is reached from this side, there is again a simultaneous pre- 
cipitation of ice and salt. A solution having the concentration corre- 
sponding to the eutectic point has the lowest possible freezing-point. 


2. The Vapour Pressure Curve and the Melting-point Curve. 

The method of solving thermodynamic problems which was used in 
the preceding section consists in seeking the extremal value of the 
function S or F or G, whichever is pertinent to the situation at 

hand. The procedure demands 
little thought but does require 
some computational work. It is, 
however, possible to proceed 
otherwise by considering a cyclic 
process for each special case, and 
applying to it the two thermo- 
dynamic laws. We shall compare 
the two methods in deducing the 
vapour pressure curve. Let us 
formulate the problem in this way: 
Consider a quantity of liquid 
in a closed vessel together with 
1 its vapour. The vapour pressure 

is measured by means of a mano- 
meter. Since we are dealing with a two-phase system involving only 
one substance, there will be but a single degree of freedom, which we 
take to be the temperature. The volume plays no part, for if we de- 
crease the volume the pressure does not increase — as it would if a gas 
alone were present. Instead, some of the vapour condenses. Using the 
device of a cyclic process for determining the relation of p and T , let 
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us consider the cycle represented by fig. 1. At a temperature T + dT 
we vaporise one mol of liquid, thus moving a piston forward against 
the pressure p + dp. Next, we relieve the pressure somewhat by de- 
creasing the application of heat and arrive at the temperature T. At 
this temperature we reduce the volume until the vapour has again just 
condensed, after which we apply a small amount of heat to bring the 
substance back to the temperature T -J- dT. Neglecting all second- 
order quantities, as well as the minute quantities of heat involved in 
passing between the two neighbouring temperatures, the work obtained 
turns out to be dp( v 1 — u 2 ), where the subscript 1 refers to the gas and 
2 to the liquid. In the evaporation process an amount of heat equal 
to the heat of vaporization q T+dT q T was communicated. By the 
Second Law, a fraction dT JT of this was converted into work, so that 



d v _ q T 

dT (t>i - v 2 )T‘ 



This Clausius-Clapeyron differential equation of the vapour pressure 
curve is rigorously true, but before performing the integration we make 
some simplifying assumptions. For one thing, we neglect the volume 
v 2 of the liquid in comparison with that of the vapour; also, we assume 
that the equation of state of an ideal gas applies to the vapour; finally, 
we work with constant specific heats. First consider the liquid to 
evaporate at T = 0 and p = p, and the resulting vapour to be warmed 
to T°. Alternatively, let the liquid first be warmed from 0° to T° and 
then vaporized. We have 


q 0 + c v T = q T + cfT or q r = q Q + ( Cp - cJT, . (13) 


where q 0 is the heat of vaporization at T = 0. Then 

d P __ bo + (c, - c 2 )T1 0 
dT RT 2 * * * 

Integrating, 

log? = -l% + 0 -^ log T + hgA-, 


or 


RT 

p = Ae~& T^r. 


. . (14) 
. . (15) 


(16) 

Since the influence of the last factor is small, the equation shows that 
temperature PreSSUr6 nSeS ra P idl y— almost exponentially— with the 

Let us now deduce the vapour pressure curve by the other method 
as well. In order not to repeat exactly what was done, let us take 
account here of the temperature dependence of the specific heats At 
equilibrium the temperature is, of course, the same P throughout the 

f qU l d 'J ap0UI , ayste f • Ir >asmuoh as we seek the equilibrium position 
for a given value of T and of p, we must find the minimum vaTue of 
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ff. Jf there are n, mols of vapour and mols of liquid, the total free 
enthalpy is 

G==n i9i + n 2 g 2 , (17) 

where the g { refer to one mol. The variation is to be taken under the 
conditions 


T = const., p = const., 74 -f n 2 = const. . . (18) 

Since the g { depend only upon T and p, they must remain constant, 
on account of the auxiliary conditions. We have thus 


0 = hn l g 1 + hutf . 2 . . 

We also have 0 = 87 ^ + 8 >i 2 , 

The method of undetermined multipliers yields 

#1 + ^ = g 2 A = 0, 


. (19) 
• (20) 


or 


9i—9s = Vi — w* + p{v x - v 2 ) + T(s 2 - Sl ) = 0. (21) 


We must now try to compute the separate terms of this equation. 
For this purpose we treat the vapour as an ideal gas. In the neigh- 
bourhood of the point of condensation this may involve considerable 
error. We also neglect the volume of the liquid in comparison with 
that of the vapour in calculating the factor (^ — v 2 ). Under these 
conditions, by (13), p. 504, 



We have, in addition, 




The expression (w 10 — u 2 0 ) represents the value at absolute zero 
of the difference in energy between vapour and liquid, i.e. the energy 
which must be supplied to the liquid to overcome the internal forces 
of attraction. This is then q 0 , the value at absolute zero of the work 
of vaporization (or since it is usually given in calories, the heal of vapori- 
zation). Using equation (23) and Mayer’s equation [(14) p. 505], there 
results 


u i — u 2 + P( v 1 — v z) = % c v dT —f o 


c 2 dT. 


(24) 


* For the liquid we start also with the relationship (3 u(dT) v — c v ; in the case of 
a liquid or a solid, however, the external work may be neglected, so that the two 
specifio heats may be considered equal. 
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To calculate the expression Ts we turn to the differential equation 
for the entropy: 

ds = p (du+pdv). 

For the liquid we can again neglect the work done in expansion, just 
as we need not distinguish between the two specific heats for liquids 
and solids. Then we have 

j Qs-L 

a$2 ~ 


Integrating between the limits 0 and T t 

» 

'2 


=L 


T Co dT 
o T 


4- 5 


2 , 0 * 


(25) 


The quantity s 2 ,o the entropy of the liquid at the absolute zero — is 
undetermined in the classical thermodynamics. We shall find later 
that for a homogeneous liquid this quantity must be zero. Expressing 
V in terms of v in the term pdv, we have for the vapour 

Ag — C * dT i Rdv 


v 


Indefinite integration gives 

c„ dT 


h=f~-hR\ogv 
rc v dT _ 

J —p f" R T R log R — R log p -}- const. 


. (26) 


Now take the absolute zero as lower limit, and combine all terms not 
dependent upon T and p into a single constant s, 0 ; this represents the 
entropy of the gas at T = 0 and p = 1. Since 

log T = J'~ and c„ — c„ = R, 
the entropy at temperature T is given by 


h=/ 

J o 


T c n dT 


- R log p + s x , 


(2b') 


Now the specific heat of an ideal gas does not vanish at T = 0 and 
so questions of convergence arise in connexion with the integral’ In 
order to clarify the situation, consider the specifio heat c. to be made 
up of a part c which is mdependent of the temperature and another 

de^elT Wtl ^’ a f C ° ldlng ex P erience > approaches zero rapidly as T 

ka " P " t * * b “ t » monXj. “J. 


C P — C V0 + 0 


V T 


(27) 
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For p ~ 1, we have for a very small lower limit, 


s i(?\ 1) — $i{T 0 , 1) =J^ C - VT y— + c vc log T — c PC log T 0 . (28) 

Because of the rapid decline in the value of c^, we have written the 
lower limit of the integral as zero. Since the contribution of c ^ to the 
integral is negligible even at T 0 — 1° absolute, we may interpret s x 0 
approximately as the entropy at T = 1 and p = 1. Strictly, it is the 
limiting value of the difference Sl (T 0y 1 ) - c vc log T 0 as T 0 approaches 
zero.* 

Replace also c P by + c PT and substitute (24), (25) and (26') in 
equation (21). Solving for logp, we obtain the vapour pressure formula: 

„rdT 


hgp RT + RtI 0 C2<1T RTf 0 CpT<IT + rI 0 


I °J>C 

^ R 


W-rL 


Co dT . q s 


+ 


2.0 


— C 


pc 


(29) 


RJq T ' R 

If we were to assume constant specific heats we would again obtain 
equation (15), p. 535, except that the constant of integration would be 
written differently. Of course, differentiation would again yield the 
Clausius-CIapeyron equation. 

The constant (si t0 — s 2t0 — c PC )/R appearing in (29) is called the 
“ chemical constant ” of the gas, since it occurs also when considering 
chemical equilibrium. It is not determinable by the two thermo- 
dynamic laws, but may be evaluated theoretically by calculations 
involving quantum statistics. This constant may be determined 
experimentally by means of the observed vapour pressures, however, 
since all remaining quantities in equation (29) are sufficiently known. 

It is customary here to measure the pressure in atmospheres, to 
replace logarithms to base e by ordinary logarithms and to measure 
all quantities in thermal units (R = 1-985 cal). The result is 


>"8P- r«5 **+ fmrf* - ‘”> tT 


+ 


J- f 

•571 J. 


(c 


VT 


571 

C * )<W * + i. 


4-571 J o T 

As we shall find later, s 2 0 vanishes, so that 

(*i,o “ C vc) ~ log (1-013 X 10 6 ). 


(30) 


1 


t = 


4-571 


For a monatomic gas of atomic weight A the quantum statistics 
(p. 638) gives the value 2 


» = -T59 + ^ log ,4 


(31) 


* That tills difference has a definite limit is a consequence of the empirically verified 
Nemst heat theorem (p. 650). 
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All the equations derived above are valid also for the transition 
from the solid state (subscript 3) to the gaseous state. We need only 
replace the subscript 2 by 3 throughout; q is then the heat of sublima- 
tion. On the other hand, the integrated formulae contain an approxi- 
mation which prevents their application to the transition between the 
solid and liquid states, for here the two volumes v 2 and v 3 are approxi- 
mately equal, so that it is no longer allowable, as it was in the case 
of a gas, to neglect one as compared with the other. The Clausius- 
Clapeyron equation, however, is exact and may be applied also to 
the phenomenon of melting. Inverting the formula, it gives the 
variation of the melting-point with pressure: 


dT _ T(v 2 — v 3 ) 
d P q T 



The quantity q T represents the energy which must be supplied to the 
system to bring it from the state of higher subscript to the lower — 
in our case the heat of fusion. Since this is always positive, the sign 
of dTjdp depends only on that of v 2 — v 3 . If, as is usually the case, 
the solid is denser than the liquid, v 2 > v 3 , and the melting-point rises 
with increasing pressure. In the opposite case, e.g. for water, the 
melting-point falls as the pressure increases. 


3. Chemical Equilibrium in a Mixture of Ideal Gases. The Law of Mass 
Action. 

We now consider a system consisting of only one phase— the gaseous 
but of several substances, all of winch are ideal gases. The gases 
will react with each other chemically to a certain extent, but a chemi- 
cal reaction is never complete in the sense of the equation. For 
example if hydrogen and oxygen are allowed to combine to form 
water there remains always a certain amount of the original substances 
u 2 - -l he fraction of the original material remaining increases 
with the temperature. We set ourselves the task of determining the 
distribution m the state of equilibrium as a function of the pressure 
and temperature. Let the volume of the system be V and let the 

*££ 2 ,* ^ "* 

V 1^1 + V 2 A 2 -(- . . . v m+2 A m+2 -f- . . (32) 

exothermal P We , W W8 to "nte the equation in 

exothermal form, in which the heat of reaction is positive. In alterine 

the oCS ° f “f° S ’ R “T be remembe red that they must satisfy 
mols fZ?d equation, i.e. they must be in the ratio of the number* 

numbers of mols present before the exotherm* reaction fiT those 
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on the left) are positive — those on the right, negative. Hence * 

K : Smj : . . . Bn m : 8 n m+1 : . . . 

^1 * ~f~ U 2 * • • • v m +1 ' v rn + 2 • • • • (33) 

Since we investigate the equilibrium at a given temperature and 
pressure, the variation of the function G = U + pV — TS must again 
vanish. The several terms may be computed as follows: The energy 
of the mixture is, naturally, equal to the sum of the energies of the 
components : 

U = n x u x -f n 2 u 2 +... = Vn t u t . . . . (34) 


For the volume we have, by p. 49G. 




The calculation of the entropy of the mixture as a function of T and 
p is somewhat more difficult. It is true that we know the entropy of 
a single ideal gas (§ 2, p. 537). Moreover, if we have two gases side 
by side, both at the same pressure, their entropy is equal to the sum 
of the individual entropies, but if the gases are allowed to mix, the 
very process of mixing is a self-maintaining one and involves an 
increase of entropy; this process is simply the expansion of both gases 
to the total volume of the vessel, whereby the partial pressure drops 
but the total pressure remains constant. It would seem more plausible 
that the entropy of the mixture should be equal to the sum of the 
entropies of the separate gases when each one occupies the volume of 
the mixture at a pressure equal to its partial pressure, for the process 
described on p. 523 permits the two gases to mix without change of 
entropy in such way that the partial pressure remains the same after 
mixing. 

In order to simplify the calculation, we assume the specific heats 
to be constant and obtain, as in equation (26') (p. 537), 

s t = c vi \ogT-R lo gpi + s i0i .... (36) 

where s, 0 embraces all constant terms, including the term appearing 
for constant specific heat, c v lo gT 0 , as T 0 ->• 0! 

Expressing the partial pressure in terms of the total pressure, we 

have, by p. 496, 

s { = c pi log T — R log n { — R log p + R log + s iQ 
= s { (T y p) — R log + R log (37) 

and S = 'Zn i s i (T, p) — log w,- + log En,-. . (38) 

* With this agreement as to sign, the changes 8n { signify the numbers of mols 
formed in the endothermal reaction. 
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The entropy of the ith gas at a temperature T and pressure p is 
denoted by s£T, p). Multiplying (35) by p, adding it to equation (34) 
(p. 540) and subtracting (38) multiplied by T, and making use of the 
fact that RT = pv { , we obtain the Gibbs Potential of the system: 

G = Zn^(7\ p) 4 - RT2,n { log n { — RTl.rii log Jjn iy (39) 


where the g^T, p) denote the Gibbs Potentials of the separate gases at 
temperature T and pressure p. In order to determine the equilibrium 
it is necessary to take the variation of G with the auxiliary conditions 
(33) (p. 540), as well as with ST = Sp = 0. Since the g i depend only 
upon T and p their variations vanish and we obtain 


8 G = 0 = 2 Sn^ + RT1, 8 n, log n t + RTY* Si n t 

-RTZ 8 »* log - RTX Sn ti 

or, on account of (33), 

log rc, — log 

RT 1 


(40) 

(41) 


or since 



Zv < h SPi = - T) + logp = log K„{p, T). 


It remains oniy to calculate the right-hand member. By equations 
(22) and (23) (p. 536) and equation (36) we have, for c. constant, 

—Xvadp, T) + RTZ, v,. log p = —2 — 2 vfi vi T — 2 v,RT 

+ T log TT, vfivi - RT logp2v ( -)- TZv^ + RT log P 2v 4 

= — 2v 4 u i0 — T2 vfi v{ -)- T log TT.vfi vi -f TT y,s, u . . ( 43 ) 


to •° Ur conventl ° n regarding the sign, 2 v 4 u 40 represents 
the d^erence m energy at T = 0 of the system before and after the 
reaction (written m exothermal form), i.e. the heat of reaction q~ B at 
the absolute zero. After dividing by RT we note the presence rf the 
combination ( S Cli )IR the “chemical constant” i, of the indi- 
vidual gases which we already met in connexion with the vapour 
pressure curve. Hence, by analogy with equation (30) (p 538) P 


log ^ = — JToM) j_ l°g T v . ^ . 
(atm) 4-571 T + L985 


(44) 


or using a constant B which depends only 
to (16) (p. 535)* y 


upon the i { we have, similar 


botwLXtic'nc^The {KiSSSSf *" Pra0ti0al whU ° < 18 > <«'> 
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= Be-T'IRT T l. iCvi IR = KAp> n (44 ,, 

^m+1 P m + 2 ' ' * 

Equation (44') expresses the celebrated Law of Mass Action, deduced 
by Guldberg and Waage from kinetic considerations. These investi- 
gators, however, were not able to give an expression for the mass 
action “ constant ” K p , which really depends on T and p, and so is a 
constant only at a given temperature and pressure. The name “ Law 
of Mass Action ” comes from the fact that a change in the mass (and 
hence the partial pressure) of one of the factors in the reaction imme- 
diately causes a change in the partial pressures of the others. If, for 
example, we have an equilibrium between an exothermal compound 
1-2, (subscript 3), with its two constituents 1 and 2, and if T and p 
remain constant, the addition of one of the constituents immediately 
causes the partial pressure of the compound to increase, since the 
fraction Pifpjp 3 remains constant. This means that the addition of 
one of the products of dissociation lowers the degree of dissociation.* 
For many purposes it is more useful to write the Law of Mass 
action not in terms of partial pressures but in terms of relative con- 
centrations C( (mol fractions) which are defined by 




Using p { = Cip and v = Zi\, there follows 


n c/< = ■ t 3-- c S- • 1 ■— = Be~"~ ,IRT T-^Rp— = K c (p, T). (46) 

m + l m + 2 


For reactions which involve no change in the number of mols, v be- 
comes zero and hence K c = K v and the equilibrium does not depend 
on the pressure. If one compares equation (44') with the vapour 
pressure formula (16) (p. 535) deduced under the same simplifying 
assumptions (constancy of specific heats), the similarity becomes 
evident. This is not fortuitous. In the following sections we shall 
derive a generalized law of mass action for polyphase systems, in 
which category the simple process of evaporation of a single substance 
is also included. As in the case of the vapour pressure formula we 
can set up differential relationships for the law of mass action which 
are free of approximations. Using the mol fractions c i3 (41) becomes 

2 Vi logc< = — ^ 2 v^i = + log K c {p, T). . (47) 


• This result, as well as many others, both physical and chemical, comes within 
the scop^ of a general law given by Le Chatclier: If some stress (e.g. a ^nge o f ^res- 
sure, temperature, concentration, &c.) is brought to bear on a ,U 

the equilibrium is displaced in the direction which tends to undo the effect of the stress. 
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Since (dgJdT) 9 — — s { (cf. p. 527), 

(M) = + _L s ^ (+ ^ X ,,, = A. X „ K+ (48) 

But the expression X^(w,- + pv { ) is the heat of reaction at constant 
pressure, q~ P , by equation (38) (p. 511). Hence 



In the same way, if we denote by v the change in volume occurring in 
the reaction (which is deter min ed by hol ding the pressure constant), 
i.e. 2iw, then we have, since ( dg/dp) T = v, 



4. Chemical Equilibrium in a System consisting of Dilute Solutions and 
Ideal Gases. 

Those multiple phase systems which contain, besides the gaseous 
phase, dilute solutions, i.e. phases in which one substance (subscript 0) 
the solvent — is present in far larger quantity than any of the others, 
lend themselves to detailed investigation. We consider a solution to be 
dilute m the thermodynamical sense if further addition of solvents 
causes no decrease in volume and if no heat exchange (heat of dilution) 
ib detectable with further dilution. The more nearly these conditions 
are fulfilled, the better our conclusions will agree with experience. 
We again examine the equilibrium at a given temperature and pres- 
sure and must therefore find the extreme value of the function 0 
1 he separate terms may be calculated as follows: Denoting the sol- 
vent by the subscript 0, the internal energy is given by 


U — U(p t T , n 0 , n 2 . . .) ( 50 ) 

On account of the smallness of the mol numbers n,, n,, ... we can 
develop this function in a Taylor’s series and obtain 


+ 


(51) 


- 

8dvent art i d ^ V f . d6pend 0nl ? ° n the num ber of mob of the 

foJZl ^ “ e *** atn ^--- = 0 - Write «, 


U — WqUo -f WjUj 4- n 2^2 + . . , . 


(52) 


5 - SttcttssSttss; rs 
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series after the linear terms. According to Coulomb’s Law, the mutual 
energy of charged particles (e.g. the ions in electrolytes) is large on 
account of their relatively great charge, even when the particles are 
far apart. Hence our results will not be applicable to electrolytes. 

In the same way we make the analogous abbreviation v t = dV /dn iy 
and have 

v — Vo + n i v i + n 2 v 2 + (53) 

According to our convention, u { and v f represent the partial energies 
and partial volumes respectively of the dissolved substances per mol. 

If we were to develop the entropy function in the same way, we 
would have 0 

» = Vo + Vi + V 2 + • • • » • • . (54) 

which would represent the entropy of a system consisting of n 0 mols 
of solvent side by side with n { mols of the solutes, but the mixture of 
the substances would not be represented. Hence there must be, in 
addition, a function of the mol numbers which does not depend on T 
or p. This function is determined if we know its form for any one state 
of the system. According to p. 541, the value of this function is 
—RltUi logc< for the gaseous state. This value must be retained for 
all temperatures and pressures. Thus, to a first approximation, we 
have the same expression as that derived for gases in § 3 (p. 541): 

G — 'Ln i g i {T, p) + tfTEn, logc ( . . . . (55) 

If the solvent does not take part in the reaction we set v 0 = 0, and 
the chemical equilibrium in a dilute solution is governed by 

n c { 'i=K e (p,T) (56) 

The dependence of the equilibrium constant K e on temperature is 
again given by equation (48') (p. 643). There is no variation with 
pressure, since no change of volume occurs. The most important 
reaction which takes place in the solution is that of electrolytic dis- 
sociation, but in this instance the inter-ionic forces cannot be 
neglected. This is why strong electrolytes, i.e. those which are highly 
dissociated, do not obey the law of mass action. In such solutions the 
interactions give the impression that the dissociation is incomplete, 
while actually it is practically complete at great dilution. On the 
other hand, weak electrolytes— especially organic ones— conform 

very well to the mass action law. 

We can now investigate the equilibrium of multiple phase systems, 

also, provided they consist of ideal gases or dilute solutions. As a 
special case, of course, individual phases may consist each of a single 
substance. We must rule out the presence of liquid and solid phases, 
which comprise several substances in approximately equal concen- 
tration, since it is then no longer allowable to terminate the Taylor 
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expansion with first order terms. We again distinguish the phases 
by dashes and superscripts, and have for the G function of the entire 
system 

0 = Zn/g/fa T) + Xn { "g t "lp, T) + . . . 

+ RT(Ln/ log c/ + Sn/' log c/' + . . .) 

= 'Zk'2‘in i w g i {k) (p, T) + /2rS2n f (fc) logc/ w . . . . (57) 

The determination of the extremal value of G under the usual auxiliary 
conditions leads, exactly as above, to the equation of the generalized 
law of mass action, which also includes transformations between 
individual phases: 

Z t 2W»logc/»= - ^ 2* (/>, T) = log K c (p,T). (58) 

The same equations are valid for the derivatives of the function log K c 
as in the case of the special mass action rule for single-phase systems. 

We now consider a two-phase system whose liquid phase is a dilute 
solution and whose solid phase is the pure solvent. Characterizing the 
solvent by the subscript 0, we have, for the liquid phase, 

01 ~~ V 4- V’ c ° = < + < = 1 “ ^ • • < 59 ) 

and for the solid phase 

°r = 0, c 0 " = 1 (59') 

ll»e reaction in question is, in the exothermal formulation, the passage 
of one mol of solvent from the solution into the solid phase, whereby 
the heat of fusion is liberated as the “ heat of reaction ” q~. Hence 
the condition of equilibrium is * 

!°g(l — <h') - logl = log K c (p, T). ... (60) 

fween S th! n 6 ' g ' atmos Pberic pressure, this is a relation be- 

til® MmUmnm temperature T, i.e. the melting-point, and the 
concentration of the solution. For the pure solvent alone, c.' = 0 and 

H at b6C ° meS Zer °> 8mco log 1 occu^ 1 the left 

which con f lnt . Pressure, log£ c (p, T) is developed in a Taylor’s series 

* the approximate £, 7$ 

-V=(,-T 0 )(^)= ( ,_ ro)fc 

T ~ To= < + ^'f^^~<T7- • • < 61 > 


Since the conversion of one mol of liquid solvent to the solid state 


( 1711 ) 
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liberates heat, q~ P is positive, so that the freezing-point is lowered. 
The equation shows the depression of the freezing-point to be propor- 
tional to the ratio of the number of mols of dissolved substance to the 
number of mols of solvent. This is Raoult’s Law. The factor of pro- 
portionality is determined by the melting-point T 0 of the pure solvent 
and the heat of fusion q v . If several substances are dissolved, we have 
the sum Sn,/n 0 in place of as may be verified at once. If, for 

example, we dissolve a substance which breaks up into two con- 
stituents, say a binary Z-Z-valent electrolyte, the depression of the 
freezing-point must be twice as great as for the same number of mols 
of a substance which does not decompose. In reality the double value 
is attained only when the dilution is very great, from which we already 
concluded that equilibrium between molecules and ions exists, especi- 
ally since the degree of dissociation as determined by the lowering 
of the freezing-point agrees well with the values obtained from electro- 
lytic conduction. Actually the dissociation of strong electrolytes is 
much greater than that to be expected on this ground; the deviations 
from the value corresponding to complete dissociation are for the most 
part attributable to the electrical forces operating between the particles 
(cf. the following section). 

For substances which do not break up (non-ionogens) we obtain 
the molecular weight from the lowering of the freezing-point. It is 
readily calculated from (61) that one molecular weight of such a sub- 
stance dissolved in one litre of water causes the freezing-point to drop 
to -1*86° C. 

The same calculations hold also for the elevation of the boiling-point , 
the pressure remaining constant. The sign of T — T 0 is reversed, 
since the solution is now the phase of lesser energy. The calculation 
shows that one mol of a non-ionogen dissolved in one litre of water 
raises the boiling-point to 100-52° C. 

Finally, we wish to derive the formula for the osmotic pressure of 
a solution. Consider a system composed of two liquid phases, the 
solution and the solvent. They are prevented from mixin g by a 
separating wall. Let this wall be selectively permeable to the solvent. 
The wall may be considered non-existent for a substance whose free 
passage it permits; hence there must be the same number of mols of 
this substance per cubic centimetre on both sides of the membrane, 
as in the case of a gas. But in the system under consideration this is 
certainly not the case; for, besides the solvent, the dissolved sub- 
stance is also present on the solution side, so that the solvent will 
continue to pass through the membrane until there is no longer any 
difference in concentration on the two sides. Thus with a solution of 
finite concentration and equal pressure on the two sides, equilibrium 
cannot exist. However, by placing the solution side under higher 
pressure, the passage of solvent can be prevented. 



547 


XXXII.] THE SECOND LAW AND EQUILIBRIUM 


The two phases are thus under different pressures. A system of 
this kind is not included in our previous considerations, and the con- 
ditions for equilibrium must be derived anew. We wish to examine 
the equilibrium at a given temperature, a given pressure and a given 
pressure difference, which we call the osmotic pressure. Hence a virtual 
displacement of equilibrium is free to cause only a change in the 
number of mols in each phase, and it is evident that the condition of 
equilibrium BG = 0 again leads to the equation 


2 2 v { k log Ci k — 



2 2v s -V. 


The difference is that the functions g { k refer to different pressures. 
In the case at hand, the reaction consists of the transfer of one mol 
of solvent to the solution side. To distinguish the solution from the 
solvent, the former is denoted by a prime. Then 

1 log Cq — 1 log 1 = log (1 — cf) — cf 

= RT^°^ y ^ Rd*^° ^ -^ osm ’ 


Developing the right member in a Taylor’s series which we terminate 
with the first term, and using the relation (31) (p. 527), we obtain 




nfRT nfRT 

Posm / ^ jrf • 

n Q v V 



Thus the osmotic pressure is equal to the pressure which the same number 

of mols of the substance would exert if they were in the form of a gas 

occupying the volume of the solution at the same temperature. This was 

first shown by van’t Hoff. The law will be deduced from another point 
of view on p. 585. 


6. Thermodynamic Equilibrium of Dilute Solutions of Strong Elec- 
trolytes. 

As already mentioned, the older theory recognized the existence of 
n equilibrium between the molecules and the ions of strong eleotro- 

CMduct^T 61 ’ ru Str ° ng i e,eCtr0lytes - i - 6 ' tbosa which are good 
conductors, the equilibrium does not satisfy the Law of Mass Action. 

t lfc * not P™ ble ^ neglect the inter-ionic ele«: 

toSd X P lT T™ pt '°? that the ^dividual particles may be 

h™ ™ m ?i e ® ule8 1 . of “ ldeal 6“ is evidently not justified 
e must add to the previously used internal energy V u an 



548 


HEAT. PHENOMENOLOGICAL PART [Chap. 

electrical contribution which, by the method of Debye, may be com- 
puted without undue difficulty. 

In the case of electrical conduction the situation is more compli- 
cated, and so we refrained from going through the very cumbersome 
calculations when we treated this phenomenon on p. 423. 

The entropy of the system is also altered; as a result the function 
G, which determines all equilibria at a given temperature and pressure, 
also has an added term: 


G=G id + G el = U + v V ~ TS. 


• • 


(64) 


For condensed systems the term pY is not of consequence, and it 
is not necessary to distinguish between (cG/dT) p and (cGldT) v ; hence 
by equation (33) (p. 528) we have the following differential equation 
for G : 

G=U+T-£„ 


Since this equation is satisfied by G id , the added electrical term G e] 
must also satisfy it; this serves to determine (z el from C7 el , for it follows 


by integration of 

<? el =p d + (65) 

that G a = - T J^dT* (66) 


Tliis shows that most of the work required to find the function G is 
done if the electrical energy U ti is known as a function of the tem- 
perature. This will now be determined for a binary Z-Z-valent electro- 
lyte. The extension of the calculation to more complicated cases is 
not difficult. Let us consider a single positive ion. As mentioned on 
p. 423, there will be — averaged in time — more negative ions in its 
neighbourhood than positive. The average distribution of charge may 
then be replaced by a continuous space charge determined as follows: 
According to Boltzmann’s Principle (cf. p. 588), the number of par- 
ticles possessing an energy u is proportional to 

-ulkT 


♦ The integration is effected by starting with the homogeneous equation 

• 0 , 




d'T 

whose integral is O t , - aT. We now vary the parameter, putting 0„ = T ■ v(T) and 
obtaining the differential equation 

+ 0 



for v. The integral is 


v «= 
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where k is the gas constant per molecule, 1c = R/L. At a place where 
the potential is ip the charge Ze has the energy Zap, so that the number 
of positive ions in a volume element dV at that point is 


dN + = Ce~ Zt * lkT dV. 


( 67 ) 


The constant 0 is determined from the fact that uniform distribution 
must result for ip = 0; C is thus the average number of positive ions 
per cubic centimetre, which we denote by N+. In the same way, the 
number of negative ions in the same volume element is 

dN_=N_e +Zc * lkT dV (68) 

For a Z-Z valent electrolyte N+= N_= N. Hence the space charge 
density at a point where the potential is ip is 


p = — 2 NZe sinh 


Zeip 

Jr' 


(69) 


Now p and ip are connected by Poisson’s equation (p. 278 et $eq.): 


A / 4 7rp 8-nNZe . , Zeip 

= © = -IT smh W’ 


(70) 


where D is the dielectric constant of the solvent. The value of the 
potential at a distance of one ionic radius from the point in question 
is such that we may develop the term sinh (Z&p/kT) and omit the 
higher powers. In this way we obtain 


. , 8rrXZ 2 e 2 , 

= ~wr * = * +• 


• (71) 


where 



Z 2 e* 


The quantity x, which has the dimensions of the reciprocal of a length 
aU the subsequent computations. Its meaning becomes clear if^ve 

ohSS round , the Pf° blem “ hand » that of the distribution of 

thS i central point, and consider the simpler linear case of a plate at 

he potential <J/ 0 dipping into a liquid electrolyte. The differential equation isthen 

, . dx 2 = *** (72) 

and its integral is * 


t \ > = 


— KX 


(73) 
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For the spherically symmetric case which interests us here the 
differential equation becomes 


d 2 ^ 
dr 2 




As may be verified by substitution, the general solution is 

<l> = A ~ + B e — (75) 


Since the potential must vanish at infini ty B= 0. The constant A is 
determined by the fact that the potential in the immediate neighbour- 
hood of the selected ion must be that of the ion alone. The resulting 
value of A is then Ze/D, and the solution becomes 



Ze 

Dr 





The second form shows how the potential is made up of one part 
attributable to the central ion and another part due to the presence 
of the surrounding ions of opposite sign — the “ ion cloud ”. At r — 0, 
i.e. at the centre of the selected ion, the cloud generates the potential 

• • (77) 

This is equal to the work required to bring a unit charge from infinity 
to the middle of the spherical cloud. Hence the potential energy 
possessed by the central positive ion is 

u+ = + Ze</j 0 =-^ (78) 


Similarly, for a negative ion we would have 



Z 2 e 2 K 

D • 


Since each ion is counted twice over — once as the central ion, once as 
part of the ion cloud — the total energy is 

12 NVZ 2 e 2 K NVZ 2 e 2 / 8* -NZ 2 e 2 (79) 

U *' = ~ 2 D D~ V DkT ' 


Substituting this in 
tion r 


equation (66) (p. 548) and performing the integra 

2 NVZVk 
G ’' - “ 3 D 


. . ( 80 ) 
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D 


(80') 


where L is Loschmidt’s (Avogadio’s) Number. Then the Gibbs Poten- 
tial of the electrolytic solution at complete dissociation is 


ff=Vo+ 2« lft + RTn„ logc„+ 2_RZ’n 1 log^— \n 1 LZ ^ K , 


(81) 


where 


v 


DkT 


8t rNZ 2 e 2 _ jSTm^LZW IStt^LZ^ 

V VDkT ** V n^kT ' 


VDkT 


v 


(82) 


The last form shows that k depends on the number of mols of the 
solvent. Once the characteristic function G is determined, all else is 
merely a matter of computation. 

We now calculate the difference between the lowering of the 
freezing-point in this case and that for complete dissociation without 
electrical interaction. The process consists of the conversion of one 
mol of solvent from the liquid state to the solid state. The only quan- 
tities which vary are the numbers of mols of the solvent in each phase; 
it is to be remembered, however, that these enter into the quantity k. 
We shall not yet replace the variations in the equilibrium formula 
by the conversion numbers v { which are proportional to them. Indi- 
cate the liquid phase by one dash attached to the letters and the solid 
phase by two dashes. Then we have, exactly as in § 4 (p. 545), 

*o" l°gc 0 /# + 8n 0 ' log c 0 ' 

= - jgij W + KV - I ^ ^ 8n„'[. . (83) 

Noticing that 

Sn 0 = — &<; c 0 " =1; c 0 ' = 1 — 2c/; logc 0 ' ^ —2c/, (84) 
we obtain 

_L /„ ' n n ^ 2 

RT ( 9o 90 * ~d~ §£->/• • < 85 ) 

“ m P 4 - we develop the functions g 0 ’ and g 0 " at the point T„ i e 
che freezing-point of the pure solvent, and if we remember that 


2c,' = - 


we obtain 


( 9 r),--V; (| y) s =- s o"; = 


(86) 


2c,’ — j ni'Iffe* 9 k 

^ MV • 


• . (87) 
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If the electrical forces were not taken into account we would have for 
the lowering of the freezing-point, at complete dissociation, 

AT id = _ i.e. 2cj' = - . (88) 

Using this expression to compute the relative departure from the 
present value we obtain 


AT id - AT AT _ x n/LZV 0* 

A T iA A T:, 3 DRT^ dn { '* * 


(89) 


It remains only to compute the value of 9/</0n o '. 
mate expression 

_ ISim^LZh 2 

K V n 0 ' Vo DkT 0 

we have at once 

d K . K 


From the approxi 


ok j k 


(90) 


and since n/ /n 0 ' = c/ and R = kL y 

. A7 7 j LZ 2 e 2 K VTv 

~ AT ld * = 


V ZW 
Vq D 32 (RT 0 ) 3 ' 2 


(91) 


We then have the following result: The relative departures from the 
value of the depression of the freezing-point for complete dissociation 
and negligible electrical interaction are directly proportional to the 
square root of the molar concentration n/ /V 0 ', inversely proportional 
to the 3/2 power of the dielectric constant and — for Z-Z-valent 
electrolytes — ^directly proportional to the cube of the valence. This 
relationship is well substantiated experimentally for very dilute 
solutions. 

A refinement of the calculation must take into account the finite 
ionic radius; this has been done in the complete theory of Debye and 
Hiickel. 



CHAPTER XXXIII 


The Nernst Heat Theorem 

1. Free Energy as a Measure of Chemical Affinity; Determination of 
this Quantity for the Galvanic Cell. 

In the “ classical ” thermodynamics, which employs only the two 
fundamental laws discussed in previous chapters, the absolute value 
of the entropy remains undetermined; as a result, the values of the 
functions F and G are fixed only as far as a linear function of T. This 
indeterminacy becomes very inconvenient in certain applications. 
Thus we saw in the preceding chapter (p. 634) that it is not possible 
to calculate the absolute value of the logarithm of the vapour pres- 
sure from a knowledge of the specific heat and heat of vaporization — 
that we required, in addition, a knowledge of the “ chemical constant ”, 
which is closely connected with the entropy constant. But the latter 
is determined by the two laws of thermodynamics only to the extent 
that a value once given must be preserved in all changes of the system. 
On the other hand, the Nernst theorem leads to a definite statement 
regarding the entropy constant of solid and liquid substances. The 
calculation of the chemical constant of a gas can be performed only 
with the help of the Quantum Theory. It is through a knowledge of 
the entropy constant, or of the chemical constant, that the equilibrium 
formulae of the preceding chapter become really of practical use. Only 

when this is known can a chemical equilibrium be computed theoreti- 
cally. 

Nernst arrived at his heat theorem through the problem of cal- 
culating chemical affinity. It was a considerable time before a practical 
nwasure of the somewhat anthropomorphic concept of the chemical 
aminty of two substances— or, as we say to-day, of a chemical re- 
action was found. One of the foremost demands to be made of such 

cee<T 1 + bB wv \ teU “ Whi0h dire0ti011 the action pro- 

and— t0 ° k , the heat of reaction as a measure of affinity, 
as a matter of fact— at room temperature most reactions are 
accompanied by the evolution of heat. Nevertheless, there are some 
very common processes-like the dissolving of most salts in water— 

t heir e it: 0raPa T d n by C ? 0l “S' aad * are 

conX* volume X 1 Wh0 fi i St reco 8nized that for reactions at 
constant volume the free energy F is the proper measure of affinity, 
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while for reactions proceeding at constant pressure it is the Gibbs 
Potential G. These quantities are a minimum at equilibrium; the 
greater the values of these functions compared with the Tninimn m 
values, the farther the system is from equilibrium — hence the more 
readily will the reaction occur. The free energy of a chemical reaction 
proceeding at constant pressure is easily determined experimentally 
by using the reaction in a simple reversible galvanic cell and measuring 
the electromotive force of the cell. For condensed systems, i.e. those 
not containing the gaseous phase, it is not necessary to distinguish 
between isometric and isobaric reactions and hence between F and G. 
Suppose the chemical change in this cell takes place by the migration 
of an ion bearing a charge of Z units. Then when one mol has been 
transported, a charge ZJ has passed. The Faraday equivalent charge 
is denoted by J here to avoid confusing it with the free energy. For 
a difference of potential $, the work done is given by ZJ<&, and this 
is equal to the change in free energy: 

a) 


Now f satisfies the differential equation 

. '- +T Gto: 

so that 

and hence by (1), 


_r( d £\ 

Var/; 


• • 


( 2 ) 


(T) — J 

ZJ 




( 3 ) 


This important relationship between the potential difference, the heat 
of reaction of the chemical process and the temperature coefficient of 
the potential difference, may be derived also from the equation 
TdS = dU -+- dW if the external work is expressed in terms of the 
electrical quantities (cf. Ex. 125, p. 528). Since it is usual to express 
cl) in volts and J in coulombs, the heat of reaction must be expressed 
in joules. 

By examination of the empirical data, it is found that the electro- 
motive force of most cells at room temperature is fairly accurately 
given by the simple formula 



obtained by setting the change in free energy equal to that of the 
total energy. The lower the temperature at which the measurements 
are taken, the better the agreement is found to be. One would con- 
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elude from this that at absolute zero the differences / x — / 2 and — u 2 
are equal, i.e. that Berthelot’s Principle is strictly true at that tem- 
perature. By equation (2), this means that 

V_2_dfi 

dT dT 

remains finite for T — 0, or else that it becomes infinite to so low an 
order that the limit 

\dT dT/ 

vanishes. But since the agreement is still good at relatively high 
temperatures, Nernst made the hypothesis that the curves / 2 (J) — f x (T) 
and u 2 (T) — u^T) not only intersect at T — 0 but touch each other 
there. Corresponding to the way in which he arrived at his theorem, 
Nernst made a statement concerning only the difference of the 
functions/ and u before and after the reaction, but Planck extended 
the hypothesis by assuming that it holds for f 1 and / 2 separately, 
i.e. that 

lim f (T) = lim u(T) 
r «* o r— o 

lim (g\ = lim (P-) 

T-Q\dT/ v r-oV^^ 1 / 

As we shall see in the following section, this serves to determine the 
entropy constant of pure, condensed substances. 



2. Formulation of the Nernst Heat Theorem. 

We obtain a new form of the Nernst theorem by solving the dif- 
lerential equation of free energy (deduced on p. 527) for (df/dT) : 



f becom <» zero when T = 0, the expression df/dT assumes the 

2Stt f rt° /0 ^ de j^ e conditions. The usual means of 

separateJv and V numerator denominator 
separately and then substitute T = 0. This vields 


(M\ = (M\ _ /M 

\dTj 0 \dTj 0 \dT/ 0 


(6) 


Now, by the Nemst-Planok relation, the entire right WQWher 


is zero, so. 
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that each term on the right must be separately zero in tbis case. The 
character of the function is represented in fig. 1. 

According to equation (30), p. 526, 



from which it follows that the entropy S also vanishes at the absolute 
zero. 



We may therefore state the Nernst theorem in this way: At the 
absolute zero of temperature the entropy of a pure solid or liquid has 
the value zero. This statement applies only to pure substances, 
since for mixtures there must be an additional term covering the 
increase of entropy which accompanies the process of mixi ng. We 
must also except gases, as long as we retain the classical gas equation, 
for this equation yields an infinite entropy constant at T = 0 if the 
value of S is to remain finite. In reality, it follows from the new 
statistics (p. 626 et seq.) that the equation of state of a gas is subject 
to a correction which preserves the general validity of the Nernst 
theorem. In particular, there can be no difference in the entropy of 
two different modifications of a substance at the absolute zero. For 
instance, a paramagnetic material must have the same zero-point 
entropy whether it is magnetized or not; i.e. it must then behave 
like a permanent magnet even without the field. 

In a statistical interpretation, the Nernst theorem a ffirms that a 
body must be in its greatest state of order at the absolute zero. It is 
precisely here that the theorem has a limitation as far as its practical 
mea nin g is concerned. To be sure, the state of maximum order is that 
of equilibrium at T = 0, but as to the amount of time required for the 
attainment of that equilibrium, thermodynamics has nothing to say. 
Since the motions of ions and of molecules dwindle at low temperatures, 
states of non-equilibrium can persist indefinitely. For example, glasses 
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have a state of higher order — the crystalline — and yet a piece of glass 
is still glass at T = 0 ! Circumstances are otherwise in connexion with 
the motion of electrons and the orientation of the magnetic moments 
connected with them. Their motion is not hindered by any increase of 
internal friction with falling temperature. For this reason the conse- 
quences for magnetization are of especial importance. 


3. Consequences for the Specific Heats and Temperature Coefficients 
The Chemical Constant of a Gas. 

It follows from 

*= (fl)/ T + (^) r dv= f {du+pdv) • (8) 


that at constant volume 

(SH $).=?• « 

and 

d 2 s 1 dc v 

WFv ~ t ifv ( 10) 

This is a relation we shall need later. Integrating (9) from 0 to T \ 
obtain, by Nernst ’s theorem, 

— ('» 


In order that the integral may converge, the specific heat (for con- 
densed systems it is unnecessary to distinguish between c v and c p ) 
must approach zero sufficiently rapidly as the temperature diminishes. 
Actually, the theory of the specific heats of solids (p. 612) gives a 
decrease as the third power of T. On the other hand, the value given 
by the classical statistics for the specific heat of a gas is different from 
zero even at T = 0. Hence we must leave gases out of consideration 
as long as we take the classical statistics to be correct (see preceding 
section). It follows further from the Nernst Heat Theorem that 
au temperature coefficients of condensed substances vanish at the 

absolute zero We shall demonstrate this for the isometric pressure 
coefficient. It follows from ^ 



that (dpldT) v is given by 


and 





( 13 ) 
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( d V\ _ f T d 2 * 

\dT) v J 0 dTdv 

whence 



As regards gases, it has already been mentioned that theories have 
been developed in recent years which yield the usual gas equation as 
an approximation at higher temperatures, while in the neighbourhood 
of absolute zero neither the equation of state nor the constancy of 
the specific heat of a monatomic gas is retained. This is called the 
degeneration of a gas. Since all gases condense at temperatures con- 
siderably above the absolute zero, attempts to verify this degeneracy 
experimentally have not yet succeeded. We shall not take up these 
theories (cf. p. 628) at this point, but shall pursue another line of 
inquiry; this will not yield the value of the entropy at absolute zero, 
but will determine the entropy constant at finite temperatures. For 
this purpose we consider the equilibrium between a gas and its con- 
densed liquid, for which system the vapour pressure formula (29) 
(p. 538) is valid. Here the entropy constant $ 2>0 of the liquid is to be 
set equal to zero. Again, we imagine the measurements performed 
at very low temperatures, where the integrals contribute nothing. 
This is because the specific heats of the liquid vanish as T 3 and the 
part of the specific heat of the gas which depends on T also approaches 
zero rapidly, and all that is left for any gas is the specific heat of a 
monatomic gas as the part c vc which is independent of the temperature. 
The vapour pressure equation is then 


l°gp = - If, + C f lo g T + . . (15) 


Thus, by measuring the vapour pressure, the chemical constant 



(and therefore the entropy constant) of the gas may be determined.* 
On the other hand, the theoretical calculation of this quantity is 
possible only with the aid of quantum statistics. 


* Tho subscript “ abs ” is appended to i to denote that this is not the usual » in 
which the logarithms of the conversion factors are also included (cf. p. 538). 
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4. Unattainability of the Absolute Zero. 

It is a consequence of the Nerst heat theorem that the absolute 
zero is an asymptotic point — it may be approached indefinitely but 
never strictly reached. Since many interesting phenomena are en- 
countered in the small range between, say, 1° absolute and 0, it would 
be better to use a logarithmic scale of temperature in this region, for 
the scale itself would then have an unattainable zero. 

In order to demonstrate the unattainability, we consider the most 
feasible means of cooling — that of adiabatic demagnetization discussed 
earlier. Assume that the entropy-temperature curves have the form 
shown in fig. 2 a, which is contrary to the Nemst theorem. In any case, 



Fig. a 



at higher temperatures the entropy curve S n of the unmagnetized* 
body must he above the curve S M of the magnetized body. The reason 
is that entropy as will be explained in the following ohapter is a 

““ , of . the ^ ord f; and this is certainly greater in the unmag- 
netized state where the elementary magnets are oriented at random 

Now assume demagnetization to take place adiabatically and reversibly 

Pe Kf tUre Ta ' T L he P rocess wiH be isentropic, for if SQ is zero in 
he reversible process, then 8 S must also be zero. In the diagram then 

we proceed along a horizontal line. The point of intersection with 5 ’ 
Se“or L a ' . If - --e to start from a topS 

1 rrenW^- fl H0WeV ^ aC00rding to the Nemst ^heorem'there 

can be no entropy difierence between the two forms at the zero point, 

aS-WS ooinoidenoe b ° dy 

behave, a. though a field were pre aen£ iTifCoJee ,^^ “ ** “ 
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and so the curves must run as shown in fig. 26, in which case the zero 
point cannot be reached. Starting from a temperature T, the steepness 
of descent of the curve S 0 determines how close to T = 0 one may 
come. 

No matter what adiabatic process is considered, the same con- 
clusion results. If, for instance, in analogy with the expansion of a 
gas, we seek to cool a solid * by sudden extension, we get similar 
curves for two difierent pressures, since at any given pressure s must 
be zero for T = 0. 

Despite the fact that the unattainability of the absolute zero can 
thus be deduced from the Nernst law, the two must not be considered 
equivalent formulations. This is the case only if we add the experi- 
mental fact that the specific heats of all substances approach zero at 
least as T 3 (see p. 612). 


• At low tempo] at ares gases no longer exist as such. 
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CHAPTER XXXIV 


The Elementary Kinetic Theory of Matter 


1. Bernoulli’s Formula; Boyle’s Law. 

As already indicated on p. 496 , an ideal gas is defined from the 
atomistic point of view as an aggregate of particles which have no 
extension, and between which no forces operate. These particles are 
in motion amongst each other with all possible velocities. The ap- 
parently steady pressure exerted by the gas is due to the myriad 
impacts of these particles on the walls, from which they rebound 
elastically. Collisions between molecules are not possible for an ideal 
gas, since the particles have no extension. We wish to calculate the 
pressure. Resolve the velocity c of a molecule into its rectangular 
components c 9) c v , c t . Consider a portion of a wall normal to the x-axis. 
Each time a molecule collides with this wall it suffers a change in the 
s-component of its momentum amounting to —2mc xt while the other 
components are not altered by the elastic collision. But according to 
Newton’s Third Law, the impulse communicated to the wall is also 
of this magnitude. The impacts follow one another in such rapid 
succession that the series of impulses is felt as a constant force. If 

each impact lasts a short time At, the average force Yj which the wall 
applies to the molecule is given by (p, 108): 

= H F ‘ ,dt “ “ S 2mc ‘ C) 

on°thewalHB NeWt0n ’ 8 Third Law > the force exerted by the molecule 


F* = + 2 mc x . 


( 1 ') 


SSs “ Wat sz 
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The quotient AN /At is the number of molecules striking each unit 
area of the wall per second. 

All possible velocities occur, with respect to both direction and 
magnitude. With regard to the former, there is no uniquely indicated 
direction in space; we may therefore consider that, on the average, 
one-third of all the molecules move in the direction of each axis. Of 
each of these groups, half move in the direction of the positive axis, 
half in the negative direction. Further, we assume here that all the 
molecules have the same speed. In the next second, all those mole- 
cules within a cylinder of base unity and altitude c normal to the wall 
which are moving toward the wall will collide with it, for a molecule 
distant c from the wall and approaching it will just reach it in one 
second, while all those farther away will not. If there are N molecules 
in each cubic centimetre, A/6 of these move toward the wall which 
is perpendicular to the z-axis. In all, Ac/6 molecules collide with this 
wall each second, so that 


V = 


N me 2 


( 3 ) 


Since all possible velocities occur, c 2 is to be replaced by its average 
value c 2 , the mean square velocity. The square root of this quantity 
— the so-called “ root mean square ” velocity — is only approximately 
equal to the mean velocity (cf. p. 593). 

Let us add here a more exact deduction of formula (3) — one that takes into 
account the distribution of the velocities of the molecules. In reality, there is no 
molecule that moves with exactly a given speed in a given direction at any 
instant. Rather, we must assume that in each cubic centimetre there are d?N 
molecules * whose velocities have magnitudes falling between c and c + dc, and 
directions lying within a solid angle dCl. 

Since all directions are on a par with each other, the number of molecules 
whose directions fall within a solid angle dQ will be proportional to the size of 
this angle ; the number of molecules whose speeds fall between c and c + dc will, 
however, depend on c through an as yet undetermined distribution function /(c). 
Thus we may write 

d z N = f(c)dcdfl = f(c)dcsinQdQd<§> (4) 


The total number of molecules in a cubic centimetre, given by integrating over 
the unit sphere and over all speeds, is equal to 

N = J f(c)dc (4') 

The number of molecules in dQ. hitting unit area of the wall in time At is the same 
as the number contained in a cylinder whose base has unit area and whose length 


* The notation d 3 N is used to indicate that wo are dealing with a third-order dif- 

fer °Tho ^ assumptions made here are not hard to understand in terms of an analogy. 
If one asks in a large assemblage of people, “ Who among those present is exactly 
forty-five years old this minute?” it is almost certain that no one will respond. Hut 
if one asks instead, “Who is between forty-five and forty-six years old? seveml 
individuals may make themselvos known, and the greater the interval the greate 

the number of expected responses. 
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is cAt. Since the axis of this cylinder makes an angle 0 with the normal to the 
wall, the number is d 8 Nc * cos 0*A*. Each molecule exerts a force 2wiccos0/Ai on 
collision, so that by integrating over all directions and speeds the pressure is found 
to be 

-co -ti/ 2 2n 4 _ 

p~j I f /(c)dcsin0 . 2mc 2 cos 2 0d0d«J> = — tn I f(c)c z dc. . (5) 

Jo J o Jq 6 J\i 


If we deal with any multiplying function ^ of c> it is logical to define the mean 
value 4> as 

f <K c )/( c ) dc 

$ = Hc)f(c)dc, (6) 


f f(e)dc 


Hence, (6) becomes 


P — 


Attic 2 


• (7) 


Consider, now, one mol of gas. This contains L molecules, so that 
the number of molecules in one cubic centimetre is N = Ljv, If we 
substitute this value in Bernoulli’s equation, we obtain 


pv = 


Lmc? 


. ( 8 ) 


This is the law of Boyle and Mariotte. Identifying the right member 
of (8) with the right member of the gas equation (p. 495), we obtain 
a relation between temperature and mean square molecular velocity: 


RT = 


Inno 2 3 RT 

T~ or c2 = w 


( 9 ) 


This shows that the mean square velocity of the molecules is directly 
proportional to the absolute temperature and inversely proportional 
to the molecular weight Lm. 

A complete kinetic theory derivation of the entire equation of state 
ch^ter 8fcat ^ ca ^ methods, and will be carried out in the following 

2. Number of Collisions and Mean Free Path for Real Gases. 

The assumption that the moleoules are particles with no extension 

“ ° f T 1186 a drastic simplification of the actual situation. 
We obtain a better representation of the facts if we attribute 

SdiW? 6 £ hS “°. leoules > considering them to be elastic spheres of 

iX t Weir 611 “wX At thk time ™ wish to consider 

y the collisions between the moving molecules. If all the moleoules 
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with the exception of a single one were at rest, it would be a very 
simple matter to calculate the average number of collisions 
experienced by this molecule. This computation will be carried 
out here. There is a collision whenever the distance between the 
centre of the selected molecule and the centre of any other mole- 
cule is less than d = 2 r. The sphere of radius d, into which the 
centre of no other molecule can penetrate, is called the “sphere 
of influence ” of the molecule. If the average velocity of the 
selected molecule is c, the sphere of influence sweeps out a cylinder 
of volume 7 rd 2 c in one second. Let there be N molecules per cubic 
centimetre; this cylinder will then contain Ntt(Pc molecules and, 
under the assumptions made, there will be 

Z' = 7 rNdrc ....... (10) 

collisions per second. 

In reality, the other molecules are also in motion, and so the mean 
velocity of a single molecule must be replaced by the mean relative 
velocity c r of the molecules with respect to each other. This velocity 
is readily computed if, for simplicity, we neglect the distribution of 
velocities and calculate with a single velocity c. But since all possible 
directions between velocity vectors occur, we still obtain a distri- 
bution of relative velocities, the mean value of which we shall calcu- 
late. If 6 is the angle between the velocity vectors of two molecules, 
the relative velocity is 

/= = . 9 

c T = V c 2 -f- c 2 — 2c 2 cos 9 = 2c sin ^ 


The spatial average of the trigonometric function sin (d/2) must be 
determined. In general, the mean of a function of the spherical co- 
ordinates is obtained by multiplying the function by the element of 
area of the unit sphere, integrating over the sphere, and dividing by 
the area of the sphere, i.e. by 4 t r. 



In the present case we have 



Hence the number of collisions per second becomes 



t 


t 


. . (id 
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The more rigorous calculation, taking into account the distribution 
of velocities, leads to the formula 

Z = y/2-n N d 2 c. (IT) 


The difference in the numerical factors 4/3 = 1*33 and \/2 = 1*41 
is about 6 per cent. In view of the inaccuracy inherent in the nature 
of the calculation, and in particular the indefiniteness of the concept 
of atomic sphere of influence, the difference is of no practical account. 

I is defined as the average distance traversed by 
a molecule between two collisions, i.e. 


The mean free 'path 


c _ 3 

Z 4tt Nd r 



or, according to the more rigorous calculation, 


VlnNd 2 



Let us investigate in more detail the significance of the mean free 
path. Take W(x) to be the probability that a molecule travel a distance 
x without suffering a collision. Then the probability of traversing the 
distance x -{- dx is W(x + dx). This is composed of the probability 
that the path x will be covered and the probability that the added 
segment dx will be traversed also without collision. But according 
to the Theory of Probability, the probability that both events will 
occur is given by the product of the separate probabilities, since the 
two are independent. Then 

W(x + dx) = W(x) + ^dx=W(x)W(dx). . (13) 


Now the probability of free passage through a layer of thickness dx 
is readily found. Imagine a oylinder whose length dx is parallel to 
the direction of motion and whose cross-sectional area is unity. The 
probability of a collision within the distance dx is obviously given by 
the ratio of the combined areas of the front surface, through which 
molecules which later suffer collisions pass, to the entire front surface 
ol unit area. The former quantity is equal to the sum of the pro- 
]wtions on the front surface, of all the spheres of influence lying 
withm the oylinder,* i.e. ttN d?dx. Hence the probability that the 
segment dx is traversed without collision is given by 


W (dx) = 1 — nNdtdx. 


( 13 ') 


the projections of the 
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We then have 


W{x) + 


dW 

dx 


dx = IF(x) — W{x)ttN d 2 dx. 


Integrating, 


IF = Cer" N * x 



The constant C is determined from the fact that the probability is 
unity for a path of zero length. This makes C = 1, so that we have 


IF (x) = e~ nNd,x = e~ x,l \ 


The expression ttN d 2 is, by (10), the reciprocal of the free path 
when all molecules but one are at rest, as we assume is the case 
in the present instance also. If the motions of all the molecules are 
taken into account, the reciprocal of the free path l is 477-iVd 2 /3, and 
the relationship 

JF(x) = er*' 1 (15) 


remains valid. Thus we may look upon the mean free path as that 
path over which the probability of passage without collision is e~ l . 
The distribution function of the free path is of simple exponential 
form, hence the mean free path is not a free path of particularly frequent 
occurrence, hut merely the mean value , as may be verified by averaging 
over all free paths, i.e. by evaluation of the expression 


r 

J o 


xW(x)dx 





3. Viscosity and Heat Conduction in Gases. Determination of Avo- 
gadro s Number and of the Size of the Molecules. 

(a) Viscosity at moderate pressures 

Inasmuch as the results are very different, depending upon whether 
the mean free path is small or large compared with the dimensions of 
the vessel, we treat the case of moderate pressures and that of low 
pressures separately. It should be remarked that the low-pressure 
region for this purpose is that below a few hundredths of a millimetre 

of mercury. 

Let a gas be in motion parallel to the x-axis in such manner that 
all the molecules in a given layer have a common forward speed u. 
Assume a velocity gradient dufiz along the z-axis normal to the x-axis. 
The hydrodynamics of viscous fluids gives for the force between two 
layers of area S, in contact along the plane z = 0, the expression 

' - * £■ (16) 
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We wish to compute the coefficient of viscosity rj on the basis of kinetic 
theory concepts. According to Newton’s Second Law, the force which 
the upper layer exerts on the lower is equal to the rate of change of 
momentum of the latter. But the change of momentum can be calcu- 
lated on the molecular theory. As a result of the thermal agitation, 
individual molecules pass from the upper layer — where the flow 
velocity is greater — into the lower layer. These molecules possess 
forward momentum corresponding to that existing in the layer where 
they experienced their last impact. In the same way, molecules dif- 
fusing from the lower, less rapidly moving layer into the upper stratum 
cause a decrease of forward momentum there. The process may be 
likened to the case of two trains moving in the same direction along 
parallel tracks but with somewhat different speeds. If the passengers 
jump from one train to another, the faster one will be slowed down, 
the slower one will be accelerated. 

We shall now express this idea in mathematical form. Let the 
number of molecules per unit volume having the velocity components 
v x and v z be given by v(v g , v z ), where v is the resultant of the gas- 
kinetic and flow velocities. Tffien the number of molecules passing 
through unit area of the zy-plane per second is w z \ these molecules 
transport momentum whose ^-component amounts to v(v X} v a )v B mv x . 
In all, the force on each square centimetre is 


F 

S 


Zv[v x , v z )v t mv x , 



where the summation is to extend over all values of v x and v z which 
occm*. The evaluation of this sum is accomplished as follows! First 


we have 


v x = c x + u, v z = c, (18) 


If a molecule has experienced its last collision in the layer z % 


«( 2 ) = «0 + 2 d £. 


■ ■ ( 19 ) 


^btab ^ Ve '° 0ity “ the stra tum * = 0. Using this value, 


we 


Sv ( v *> v t)mv x v, = Xp(v x , v,)m + u 0 + 


du 

dz 



( 20 ) 


v d 0811061 out > sbce there are just as many 

molecules wrth positive o. and c, as with negative. However 

does not ca 11061 out , as may be seen by putting « ’= r cos 9, 
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c z — c cos #, in which case the factor cos 2 6 appears. If we replace r, 
c and cos 2 # in 


^ v { v x> v z )mv x v z = Tjv(v x , v t )mrc cos 2 6 


du 

dz 



by their mean values l t * c and 1/3 respectively, we obtain, on account 
of Z,v{v x , v t ) = N t 

F Nmlc du 

S 3 dz (22) 


whence, by comparison with (16), 


V = 


Nmlc 


. . (23) 


Inserting the value of the mean free path from equation (12) (p. 567), 

me Me 




4i rd 2 47 rLd 2 ’ 


(24) 


where M is the molecular weight of the gas and L is Avogadro’s 
Number. This formula is of importance on account of the fact that N, 
the number of molecules per cubic centimetre, does not appear in it. 
This means that the viscosity is independent of the density and hence 
of the pressure of the gas. Although this result may seem contrary 
to intuitive views, it is well substantiated by experiment. It is only 
at very low pressures, where the mean free path becomes of the order 
of magnitude of the dimensions of the container, that the theory must 
be modified. In this case there is a dependence on density. Quali- 
tatively, it is easy to see why the viscosity is independent of the density 
under normal conditions: despite the fact that the number of mo- 
mentum carriers increases with the pressure, their mutual hin- 
drance also increases in the same direction, thus tending to ofiset any 

change. 

(6) Conduction of heat at moderate pressures 

Instead of a velocity gradient, assume that there is now a tem- 
perature gradient in the gas in the direction of the 2 -axis. According 
to p. 489, the flow of heat through the sy-plane is given by 

( 25 ) 

dz 


•At first sight, it may seem questionable whether the mean free path l may be 
substituted for the average value of r, since the next collisions may not take place 
precisely in the plane z - 0. In this connexion it must be remembered that there is 
no propagation of probability for the collisions; i.e regardless of which point of^h 
path we reckon from, the probability of traveling the segment r is ^othatr^ 
lienee the mean value of all paths described by the molecules after passing the plane 
z - o is l, and the same is true, of course, for the original question. 
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where the temperature is denoted by T. We can calculate the co- 
efficient of heat conduction on the basis of the kinetic theory, just as 
we did for viscosity. Instead of a transport of momentum, the passage 
of molecules from one layer to another effects a transport of thermal 
energy. Again, each molecule is assumed to possess the energy of 
thermal agitation corresponding to the layer in which it experienced 
its last collision. The thermal energy per molecule is 

— |26 > 

where c v is the specific heat. If there are v{u, c x ) molecules per cubic 
centimetre with the velocity component c, and thermal energy u, 
the flow of heat is given by 

-W = £•'(«, c.)c.j;(t 0 + tj = S v(u, c.) . C -^ cos»0^. (27) 

Replacing c, r and cos 2 # by their average values, we obtain 

Nlcc v dT c v c dT 

3 L dz - i^Ld? ~dz' ( ~ 8) 


i.e. 



c v c 



Thus the coefficient of thermal conduction is also independent of the 
pressure. 

It is, of course, not quite correct to replace the quantities c, r and 
cos 2 # by their mean values in the sum, but again the rigorous theory 
yields a numerical factor which differs but little from that obtained 
here. The resulting factor is 0*247, which replaces 1/4. 


(c) V iscosity and heat conduction at low pressures 

Consider next the important case of very low pressures, where the 
bee path is large compared with the dimensions of the vessel and so 
essentially only collisions with the wall take place.* 

Imagine the gas placed between two parallel walls, one of whioh 
moves past the other with a speed u. The moving plate exerts a force 

havfLT at j® 8 *’ gaS moIecule3 that Mt the moving plate 

th ? 1 th6 ° thel Plat6 at th6 " tapS STt. A 

0 4- djQ ^1r tl0ns m0 T tl0n a zone forming the angles # and 
« + W!th the normal to the wall. Since we are concerned! Inly “th 

noti« that at room tam- 

thia instaooeta’XotTZ “ 10 and that tree pathta 
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those molecules that move toward the wall, we must take half the 
number. The number in each cubic centimetre will then be 


dN = C sin 6d9, 


where the proportionality factor C, evaluated by integration over a 
hemisphere, turns out to be N/ 2. If the mean speed is c, the number 
of impacts per second per square centimetre becomes 



Nc 

2 


2 

*M) 


sin# cos# dd 


Nc 

4 ‘ 


Since each molecule gets an additional speed u . , the momentum transfer 
per second, and hence the viscous force per unit area, amounts to 



where p is the density and is equal to pM/RT. Then, by (9), p. 565, 
we have approximately 

(30 '» 

Inasmuch as there is no velocity gradient in the space occupied by 
the gas itself, we cannot speak of a coefficient in the sense of equation 
(16); rather, the momentum transfer is independent of the distance 
apart of the walls. 

In the same way, one finds that the heat transfer per unit area at 
very low pressures amounts to 

w -lrJ)n AT '~ TA 

where T is to be set equal to \(T X + T 2 ). As long as the mean free path 
is large compared with the dimensions of the vessel, viscosity and 
heat conductivity are proportional to the pressure but independent of 
the dimensions of the container. 


(d) Kinetic Theory determination of Avogadro’s Number 

and of molecular diameter 

Since we may replace c approximately by V? == \/3 RT/M (the 

exact relation, as given on p. 592, is c = VS/^n . Vc 2 ), the only im- 
knowns remaining in equations (24) and (29) are L and d. Since both 
occur only in the combination LA 2 , a further relation is needed to 
determine these quantities separately. Such an equation is that lor 
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the van der Waals b which, as we already saw on p. 497, is equal to 
four times the space occupied by the spherical molecules: 



4 77 d 3 

8 3 




Solution of the two equations for L and d yields numbers of the order 
of 10 -8 cm. for the molecular diameters and about 6 X 10 23 for L, 
which was first determined in this way by Loschmidt. However, the 
accuracy of this kinetic theory determination of L is not as great as 
that involving the electronic charge e — F/L, from which is obtained 
the generally accepted value 6-02 x 10 23 . With reference to the gas- 
kinetic diameter, it must be remembered that this is not the actual 
diameter of a small, impenetrable sphere, but rather — according to the 
way it was defined — the radius of the sphere of influence for mole- 
cular collisions. In view of the electrical structure of atoms and mole- 
cules, this sphere of influence is a result of the mutual electrical in- 
fluences of the colliding bodies. This does not mean that other methods 
of determining atomic dimensions would necessarily yield the same 
result. We shall find that the linear dimensions of the space occupied 
by the electrons is of the order of magnitude of 10 -3 cm. All experi- 
ments which depend on the electron shells lead to dimensions of t his 
magnitude, but experiments on the passage of a-particles through 
matter lead to values of the order of 10“ 12 cm. for the diameter of the 
“ centre ”, i.e. the nucleus of the atom. 


4. Derivation of some Properties of Crystals on the Molecular Theory. 

(a) Density and lattice constant 

While the beginnings of a molecular theory of liquids have been 
obtained by starting with a van der Waals gas and assuming the 
cohesive forces to be very large, there 
is no complete theory which explains 
all the main properties, as there is for 
gases. However, for crystals — which 
we may regard as the normal form of 
the solid state — the theory has been 
developed very extensively. On account 
of the elaborate mathematical equipment 
needed, we can give only the simplest 
parts of the theory. M. von Laue’s 
discovery of X-ray interference in crystals 
(p. 385) proved the existence of the lattice 
structure which mineralogists had long 
assumed. To-day a large number of lattice types are known. We shall 
here consider only the simplest case— that of sodium chloride. Measure- 



Fig. i 
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ment of the intensity of X-rays reflected from different lattice planes 
shows that the corners of this lattice are occupied not by the neutral 
atoms as is the case for diamond, for example— but by the same Na + and 
Cl ions as are encountered in electrolysis. The ions of each kind are 
distributed upon a cubic lattice, the two lattices being interspaced, 
so that along any line the ions of the two varieties alternate (“face- 
centred lattice). If no distinction is made between the two kinds 
of ion, the crystal represents a simple cubic lattice, in which only the 
corners of the cubes are occupied (cf. fig. 1). The lattice constant a 
is the distance between two homologous ions of the same kind. An 
elementary cell is the smallest unit by whose translation the entire lattice 
can be generated. In the present case it is a cube of edge a. In order 
to calculate the density all that is necessary is to count the number 
of Na and Cl ions in the elementary cell. It must be borne in mind, 
however, that each corner ion belongs to eight cubes, and so is to be 
counted only as 1/8; similarly, a point on an edge is to be counted 
as 1/4, one on a bounding face, 1/2. Thus there are (8/8 + 6/2) 
Na + ions and (12/4 -f- 1/1) Cl“ ions corresponding to each elementary 
cell. Representing the masses of the individual atoms by M^JL and 
M C \/L respectively, the density is 


4 ( A 1 jsj a -I- A/ Q ) A/ A/ 

p== To* = ZP = v' 

4 

Hence the lattice constant is given by 




7 


l(-^Na 4" A/q) 


Lp 





This quantity is used to measure the wave-length of X-rays. The 
absolute value is again limited by the accuracy to which the Avogadro 
constant is known. Recently it has been found possible to use ruled 
gratings, of known grating constant, for X-rays, thus measuring the 
wave-length directly. The value of the wave-length may then be used 
to determine the lattice constant; this, together with the density, 
leads to a new determination of L. The value thus found agrees admir- 
ably with that obtained from recent oil-drop determinations of e. 

The method given here for obtaining an equation connecting the 
density and the lattice constant is, of course, perfectly general although 
equation (33) is not. 


(6) Nature of the cohesive forces in a crystal. Lattice energy 

of polar crystals 

What holds a crystal lattice together ? What forces hinder a further 
decrease in volume, so that even very great pressures succeed m pro- 
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during only a slight contraction of a solid body? The first question 
is easy to answer for crystals which are built up of ions, the so-called 
polar crystals: the electrostatic attraction between the ions holds 
the structure together. The second question corresponds to that con- 
cerning the nature of the interatomic or intermolecular forces which 
give rise to the spheres of influence. Instead of regarding the ions as 
spheres of a definite radius, it is more natural to introduce forces of 
repulsion varying as a high inverse power of the distance. The 



higher the power, the greater the relative effect at small distances, and 
the better the correspondence to the collision of two rigid spheres, 
where the large forces which come into play at the instant of impact 
prevent a closer approach. . The Quantum Theory has made clear the 
nature of the interatomic forces : they must arise when the electron 
shells interpenetrate. For the alkali halides the inverse tenth power 
of the separation proves to be the appropriate law of force. In fact, 
the Quantum Theory shows that a single power is really not sufficient 
the exponent itself depends upon the distance. 

The Quantum Theory has also thrown light on the nature of the 
torces of attraction in non-polar crystals, like diamond. These forces 
correspond to the binding forces between uncharged atoms, e.g. the 

tofbZTof p. 71 J° gea m ° le0nle ’ Snd “ 6 6Z P kinable on the 
If there were but two ions present, the mutual potential energy 
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of an Na ion and a Cl ion, say, would be given by 



where is a factor not yet determined. It would be incorrect, how- 
ever, to say that the potential energy of the entire lattice is simply 
the above energy u multiplied by the number of atom pairs present, 
for since the electrostatic potential decreases only with 1 /r, the 
more remote ions also contribute to the potential at any point. The 
somewhat troublesome computation of the potential due to alternate 
positive and negative charges was made by Madelung, but cannot be 
reproduced here. The result is that the potential energy per ion bond 
is —0-2905 (e 2 /r). We therefore put 

u = - 0-2905- + £ (35) 

T r* 


If we make a graph of this potential energy function in terms of the 
distance r, we obtain a curve having a minimum, since the negative 
term preponderates for r large, the positive term for r small (fig. 2). 
According to the laws of mechanics, this minimum corresponds to the 
equilibrium distance r 0 . But the minimum condition dujdr = 0 can 
be interpreted also to mean that the force— which is obtained by 
taking the potential gradient — vanishes at the equilibrium distance; 
i.e. the forces of attraction and repulsion are of equal magnitude. 
This serves to deter mi ne £, for 


— 0-2905 — , 

rj 



whence 


P = \ (0-2905) e\\ (36) 

y 


and thus, in general, 

u = — 0-2905e 2 0 — 

We obtain the total potential energy of the lattice in the position of 
equilibrium (r = r„) in the following way. In the NaCl lattice, six 
bonds connect each ion with the neighbouring ions of opposite sign. 
Since we count each bond but once, we may consider only the ions 
of one sign, say the Na- ions. The lattice contains I Na ions per 
mol, so that the energy of each pair must be multiplied by 6L. 
then obtain the lattice energy per mol as 

m = — l*743Le 2 (3®) 

uTq 
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Using the fact that r 0 = a/2 and employing equation (34) (p. 574), 
we have 


u 


— 547 x 10 3 


7 


•^Na + 


in general, 


ci 


u,= — 547 x 10 3 


7 


(39) 


where the energy is expressed in calories.* This lattice energy, valid 
for all salts of the NaCl type, is negative, since energy is liberated 
when the oppositely charged ions approach each other. This is then 
the work which must be done to separate the parts of the lattice and 
scatter them infinitely far apart. It is not equal to the heat of vapori- 
zation, for in that case the lattice is converted into molecules. 

(c) The compressibility of polar crystals 

The potential curve shows why a crystal resists changes in volume. 
As soon as the distance between two ions is increased from its equili- 
brium value, a large increase in potential energy occurs, so that a large 
amount of work must be expended. The force per ion pair is 


F = - 0-2905 ~ + 0-2905 

r * r lu 


(40) 


This force vanishes in the equilibrium position. We develop F in a 
Taylor’s Senes at the point r 0 and obtain 

I ' = 0 + ^{< r ~ r 0 )+... = 0 - 2905 e 2 (-- 3 _ 

Vo 


’*‘1 (, 


r o) • • • 


2*324c 2 . 

3 ( r r o) • • • • 


• (41) 

" V 

The work done by the internal forces when one mol of crystal is sn 
pressed that all interionic distances are changed from / 0 to r is 

W==6L Ir/ dT = ~ 6 X 2 -324 ~s f\r ~ r 0 )dr 


= — 13-94 — iLHjji) 2 


(42) 


S th^work di by°Z ^ 8Up P Hed ^hout. 

Qone b ? tbe eternal forces is given by the integral 


fpdv. 


- cal/mol, nine, it Mpresmta a 

30 


(l 711) 
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Since the pressure is zero at the start and P at the end, we obtain 
an approximation to the integral by multiplying the total change in 
volume by the mean pressure P/2. But according to equation (33) 
(p. 574) the change in volume is given by 

Ay = A ^ = A (2 Lr*) = 6 Lr 0 2 (r — r 0 ). . (43) 

Equating the two expressions for the work, 

— 13 2 9 f .‘ 3 L - ( r — r of = (’■— r o) • • < 44 ' 

from which 

13-94e 2 (r — r 0 ) __ 13-94e 2 Au _ 13-94e 2 Ay __ _ 1 Ai v 

F ~ 6r 0 5 ” 36Lr 0 5 . r 0 2 18r 0 ^ * v 

and the compressibility k becomes 

k = = 5'60 X 10‘% 4 = 3-50 X 10 I7 a 4 . . (45) 

l o' 

This relation is very well substantiated experimentally, but it should 
be noted that the inverse tenth power, which has no other basis at 
present, is fitted to the compressibility. 


(d) Qualitative explanation of the thermal expansion of crystals 

The thermal motions of the particles constituting a crystal can 
consist only of vibrations of these particles about the equilibrium 
positions to which they are bound. When an atom is in the position 
of equilibrium it is subject to no force, but as soon as it leaves this 
position, restoring forces arise which, by equation (41), are propor- 
tional to the displacement r - r 0 , and so give rise to harmomc vibra- 
tions. This proportionality between force and elongation, bo wev ® r > 
represents only a very rough first approximation If we consider the 
potential curve (fig. 2, p. 575), we see that for larger displacements, 
where the higher terms of the power series must be taken mto account 
the forces become unsymmetric, since the curve is much i steeper o 
the approaching side than on the receding side. Hence if an a t0 “ 
a given amount of kinetic energy when it passes through the position 
of Equilibrium, this will become equal to the potential “ergy 
smaller distance from the rest position on the near side than on th 
far side This means that the reversal of motion will take place soone 
on the nearside. On the average, then, the distance between two 
atoms will be increased, and this is the reason for the thermal exp 

sion of crystals (cf. Ex. 40, p. 103). 



CHAPTER XXXV 

The Classical Statistics oe Boltzmann 

1. Entropy and Probability. 

We wish now to treat molecular phenomena somewhat more ab- 
stractly by starting from a perfectly general relation which bridges 

the gap between thermodynamics and statistics, and deducing from 
it a large number of consequences. 

From the example of the diffusion of two gases through each other 

mi^hTT “ mcrease ° f entropy— we recognize that there 

must be an intimate connexion between the entropy of a system in 

a given state and the probability of finding it in this state. If a" 

containing equal numbers of white and red balls is thoroughly shaken 

that ° f , findin g 0nl 7 white balls in the upper half and 
fiL ^fin^ 6 ' * he P robabi % has increased to a 

Tv ga86 f’ -° r T Uch tbis model “ ^tended 7 Is it possible 

is, by the law of nroLhilitWn ^ * tbe re3 P ectlve given conditions 

duct of the separate probabilities X^ h^’ ^ ** ^ 


^12 — + 5 a , . . 

w n - W,w z . . . 

What function satisfies the condition that 

f( x i x a) =/(®i) +/(x a )? 


( 1 ) 

( 2 ) 


Obviously the function k log® is appropriate. We therefore take 

Or *» i ... 


( 3 ) 


8 = k log W. 

679 


(*) 



580 


HEAT. STATISTICAL PART 


[Chap. 


This is the celebrated Boltzmann Relation— one of the most impor- 
tant equations in physics. We shall find later that the factor of pro- 
portionality k is the “ gas constant per molecule ” R/L. 

How is the probability of occurrence of a thermodynamical state 
to be expressed? The probability of an event is usually defined in the 
Theory of Probability as the ratio of the number of cases favourable 
to the occurrence of the event to the total number of possible cases. 
This probability has a maximum value of unity, corresponding to 
certainty. In thermodynamics, however, it has been found more con- 
venient to take simply the number of favourable cases as the measure 
of thermodynamic probability. Thus the thermodynamic probability 
is a large number. 

In counting the number of cases favourable to a given thermo- 
dynamical state, the expression 2V! (“ N factorial ” = N(N— 1) 
(N — 2) .... 1) occurs, as in other branches of combinatorial analysis. 
Since we deal here with numbers of the order of magnitude of L y i.e. 
10 23 , the evaluation of N\ would be out of the question. 

Fortunately, however, there is an approximate formula due to 
Stirling which gives a good approximation when N is large. If we 
write 

logW! = logl -J- log2 + log3 -j- log4 -f- . . . log A, 


we may look upon the series as a sum of rectangular strips whose 
breadth is Ax = 1 and whose height is logx. When x is large the 
intervals of unit width may be regarded as differentials, so that the 
sum may be replaced by the integral 

r y 

I log xdx = [x logs — x] t y = N logN — N l. 

J i 


Neglecting 1 as compared with N and taking the antilogarithm of 
both members, we have the useful approximate formula for large 
values of N:* 



2. Calculation of the most Probable Distribution of Density in an Ideal 
Gas. 

We shall calculate the thermodynamic probability and its maximum 
value, corresponding to the state of equilibrium, for the density dis- 
tribution in an ideal gas. Here we already know the result: at equili- 
brium the density must, of course, be the same at all points. Although 
the case is trivial, it is — on account of its simplicity— suitable as 
an explanatory example of the method of computing probabilities. 

• Stirling’s Theorem states more exactly that log {Nl) & (X + j) log N - N + 
Hog(2tr), the error being approximately J/(12A*). 
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In the thermodynamic, macroscopic sense we characterize a given 
state by giving the density as a function of position. But for an 
observer capable of seeing and distinguishing between individual 
molecules, a given state would be defined only by stating the co- 
ordinates of each molecule. It is clear that any particular “ macro- 
state ” can be realized by a large number of different “ micro-states ”, 
for the interchange of two molecules does not alter the density dis- 
tribution in the least. We therefore look upon the number of micro - 
distributions corresponding to a given macro-state as a measure of the 
probability of occurrence of the thermodynamic state. In order to com- 
pute this number, we divide the volume into a number of equal cells 
of volume At. The density is then given by the ratio of the number 
of molecules N { in each cell to the volume of the cell. The cells are to 
be small, but nevertheless large enough to include a great number 
of molecules, so that the density N { /Ar remains a continuous function 
of position. If we were to make each cell of the order of magnitude 
of the average space available to a molecule, we would notice relatively 
large discontinuities in density from cell to cell, according as 0, 1, 
2, 3 . . . molecules happen to be in each. ’ ’ 

Let the cells and the molecules be numbered, and suppose that 
the molecules are thrown into the subdivided volume much like a 
handful of glass beads. Since the cells are all of equal size, it is equally 
probable whether a molecule X falls into the cell y or the cell z. That 
ts, the division into equal cells has assured equal a priori probability 
of the ceUs, as it is usually expressed. This equal a priori probability 
is not disturbed if there are already several molecules in a cell; it is 
assumed that there is sufficient room for others* Thus, in order to 
compute the thermodynamic probability, we need only count the 
number of ways in which N molecules can be distributed among Z 

oft „ n 3U ^ , wa7 th f b there are N ‘ moI ecules in the ith cell. How 
otten this state can be produced is best seen from an example. Let 
the following distribution be given: 1 

Cell: 

Number of molecules: 


1 

1 


2 

0 


3 

2 


4 

0 


5 

3 


6 

4. 


ts T moleoule3 among 8k oeUs may 156 realized ' 

Molecule number: 123456789 10 
In cell number: 616565335 6. - 

ofthe 6 cell numbem of “T “ there are Permutations 

U nUmbers of the molecules. It is to be noticed that the change 


form chief to JWif $! di5erenoe k^eon the classical statistic* and a 


new 
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in position of the molecules within one cell does not yield a new micro- 
state (or complexion ” as it was termed by Gibbs and Boltzmann), 
for the micro-state is determined solely by giving the cell numbers. 
Thus the probability of the distribution is equal to the number of 
permutations of the lower row of the preceding table. If all the cell 
numbers were different this would be N\, but since, for example, the 
number six occurs four times, all permutations of these four numbers 
among each other are omitted. On account of the fourfold occurrence 
of the number six we then have only the 1/4! part. In general, 



A 7 ! 

NJ N 2 \... 


N\ 

n mr 


• • • • 



Applying the Stirling approximation and remembering that 22V, = N 
we have 




We now introduce /(t), a density function of the cell index, defined by 


AW«At,=/(i)At. 



As long as all the cells have the same volume, we can omit the sub- 
script on At. The entropy then becomes 

S = k(N log N — 22V, log 2V,) = k{N log N — 2/(t) At log f(i) 

— 2/(i) At log At). . . (8) 


The equilibrium value is characterized by the vanishing of the first 
variation: 


-SS = 0 = 2 5/(t) At log/(t) + 28 /WAt + 28/(i) At log At. (9) 


It is to be remembered that an auxiliary condition is given by the 
fact that the total number of molecules is to remain constant: 

2/(i)At = 2V, i.e. 28 /(i)At = 0. . . . (10) 


The variations of the distribution function are therefore not arbitrary; 
equation (10) must always be satisfied. If we apply the method of 
undetermined coefficients (p. 115), multiplying equation (10) by a and 
adding it to equation (9), the coefficients of the variations may be 
set equal to zero separately, and there results 

log/ (i) + a + 1 + log At = 0 or f(i) = const. (11) 


As expected, the most probable distribution is that of uniform density 
throughout. From equation (10), the value of the constant turns out 

to be NIV, where V is the total volume. 

The division into regions of equal a prion probability is too narrow 
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a limitation. In many instances, e.g. in the distribution over the sur- 
face of a sphere, a division into zones of equal size is quite impossible; 
moreover, in quantum statistics it often happens that the individual 
cells must be given different a priori probabilities (weights), which is 
naturally equivalent to making the sizes of the cells different.* How 
are we to proceed in this more general case in which each cell has 
a volume At* of a priori probability < 7 *? In any case, we set the total 
weight G { equal to the cell volume multiplied by g { . If the distribution 
is such that one of the cells has twice the volume of any of the others 
(which have equal volumes), the probability that a molecule a should 
fall in this cell will be doubled, as will that for molecules h and c. Then 
the probability that all three molecules fall in this cell is the product 
of the individual values, and so increases to 2 3 . In general, the factor 
to be applied to a combination in which N { molecules fall in a cell 
having a weight G { is G { Ni . Thus a combination in which there are N x 
molecules in the first cell, N 2 in the second, &c., has a factor G 2 Ni G 2 N * . , , , 
and we obtain 

Nine/** N N UG t Ni N N U £/ (/)Ar < 


w n n,i ~ nN«‘ ~ n[/(i)AT < ]/<»'<‘ 

S — &[AT logiV -}- 2 /(i)At* logG* 

- E/W At* lo gf(i) - E/W At* logArJ. 


(12) 


(13) 


The calculation of the most probable distribution is carried out exactly 
as in the previous example. We have immediately 

ss 

X = 0 = E Sf(i) At* log <?*~2 8 /(t) At* log f(i) - 2 S/(i) At* 

— 28/(i)AT, logAT*, . . . (141 

with the auxiliary condition 

= 2 Sf(i) At* = 0 (15) 

Multiplying by —a and adding equation (14), we obtain 

log*?* — log/(t) — 1 — log At* — a = 0 , . . (16) 

f(i) = const, (17) 

Since 2/(i) At* = N , the constant is AT/2G*, so that the most probable 
distribution is reDresantorl bv funA^nn 


or 


NG < n 9t 


/« = 


At*2(?* 2 < 7 * At*' 


(18) 


that JhoT* 7 1 ^ stan< ? e ® a P ri °ri probabilities have been obtained in raoh manner 
we aST? with experience was realized after their introduction, so that here 
— whoae^.^h ng deduoed from experience— a posteriori probabilities 

foundation (and hence transformation into true a priori 
probabilities) must be left to further develonnW*. ^ " 
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Thus even if the cells differ in size, as long as they otherwise have no 
special a priori probabilities, the most probable distribution is that of 
uniform density. 


Ex. 128. It follows from the formulae of p. 530. that, if a paramagnetic sub- 
stance could exist in an unmagnetized state at absolute zero (this is denied by 
the Nerast theorem), its entropy per mol would in the simplest case be greater 
than in the magnetized condition by R log 2. Deduce this result statistically, 
remembering that in the simplest case only the orientations parallel and anti- 
parallel to the field are permitted. 


3. Representation of the Distribution of Position and Velocity by means 
of Phase Space. Liouville’s Theorem. 

We wish to know not only the position, but also the distribution 
of the velocities of the gas molecules in the state of equilibrium. In 
order to compute the probability of a given state, it is natural to seek 
to apply the procedure of the last section to this more complicated 
case. To do this, we need only replace actual three-dimensional space 
by a hypothetical six-dimensional space in which the co-ordinates of 
a point are the three space co-ordinates of a molecule together with 
its three velocity components.* Thus every molecule in actual three- 
dimensional space has a representative point in this six-dimensional 
space. One might raise an objection to this procedure. In ordinary 
space it is immediately evident that the a priori probability of a cell 
is proportional to its size, but how can we be certain about the a priori 
probability of a cell in the hyperspace ? If, instead of taking the velocity 
components, we choose the general momentum components p k (cf. 
p. 121), the space built up of the position co-ordinates q k and the 
momentum co-ordinates p k is called a phase space. For this space it 
can be shown that the cells of equal a priori probability are of equal 
size or — what is the same thing — that the a priori probability of a 
cell of phase space is proportional to the size of this cell. For if we 
consider a volume element At of this space, representative points will 
enter and leave through the bounding surface of the element, and we 
can compute the “source-strength” (divergence) for the element 
exactly as in the three-dimensional case. If 

r = q 1 e l + q 2 e 2 -f q 3 e 3 + P\ e \ + • ( 20 ) 


• For the present, we limit our considerations to monatomic wto 

condition is uniquely determined by the position and velocity of the centre of gravity. 
For poly atomio molecules we have angular co-ordmates and velocities also. 
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C = C^e x + C 2 e 2 -f- C z e z + + C 5 e 5 + C Q e Q 
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^2i „ 


dq. 


=! e ' + it e >+li e > + 


d Pi„ , dp 


e i + jT e 5 + 


d-P: 


(21) 


it'" 1 ' dt ~ 5 1 it 

Choosing a small six-dimensional cube as the element of volume, we 
have, exactly as in tjie three-dimensional case, 

do. 


divC=3+^ + 


■ + 1£* + ?a + 50 - 


9 22 9 2s dpi dp 2 dp, 

JL A ty* | 9 dg 3 | 9 typi | 9 dp 2 9 dp 3 

^2i (& 9 22 9 23 9 ?i dt ^~dp 2 dt dp 3 dT 


( 22 ) 


Now the molecules themselves obey the fundamental laws of mechanics 
which we may write in the Hamiltonian form: 


whence it follows that 


dpk __ dH dq± dH_ 

dt dq k ' dt + dp k ' 


(23) 


%Pk 


dpjc 

dt 


+ dq> dt ~ ° 


• (24) 


On account of this last equation, every term in (22) cancels out the 
other term having the same subscript, and so the entire right-hand 
member vanishes. Now the vanishing of the divergence of the velocity 
is characteristic of the flow of an incompressible fluid; if a given 

“.“Xl °f I> T tl0,eS A (m 0Ur case ’ • re P resen tative points) occupy an 

voliim f T e ^ T< at , a oerta ' n time, they trill occupy another 
volume of equal size at a later time. Thus the relationship between 

two equal volume elements in the phase space is such that if one is 

occupied by N, points the other is also filled with N t points. Since 

with the passing of time, all elements of the phase space are traversed 

y he flow, all dements of equal size have the same a priori probability 

is Liouvflle s Theorem. Wo have thus found a basiffoTcom- 

oonshfer t ’ Pr ° bablJit y ,° f a distribution in position and velocity. These 

- * ■>“» ■*« 

tl,«l dia< T i S mo , Iecule may be visualized by means of a model havimr 
tar/ Sethf thXvX ^ X" °f laf } uence deduoe d in the elemen- 

XX In the — of 

the appropriate £e X £“ *** T™' 10 the axis, so that 

since taton’s EolX f number ° f dimensioM - But 

n 8 Equations hold for any two canonically conjugate 

20 * 
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variables, regardless of how many degrees of freedom the system 
has, the Liouville Theorem concerning the equal probability of equal 
elements is always valid as long as the particles whose representative 
points move in the phase space follow the laws of mechanics. 


4. The Maxwell-Boltzmann Energy Distribution. 

We are now in a position to apply the method of § 2 (p. 580) literally 
to the phase space. By so doing, we can answer the question of how 
the total energy U of a system is distributed among the individual 
particles. The total energy of an ideal, monatomic gas, for example, 
is made up of the kinetic energy of translation of all the atoms. How 
is this energy distributed among the various atoms? Do all atoms 
move with the same speed, as we assumed for simplicity in the pre- 
ceding chapter? A definite energy u(i) corresponds to each cell of the 
phase space; this is the energy possessed by a molecule whose repre- 
sentative point falls within this cell. For generality, assume that the 
cells are of various sizes. Then, by Liouville ’s theorem, each will have 
an a priori probability proportional to its size. If, as in § 2 (p. 582), 
we again define a distribution function /(i) by N { =/(t)Ar* the prob- 
ability becomes N N UAr/ (i)Ar i 

w = n[/(*)Ar < V <i,ir (25) 

and the entropy is 

S= IcNhgN- IfeZ/WAr, log/(i). . . . (26) 


The maximum value of S is to be determined under the auxiliary 
conditions that both the number of particles and the total energy V 
are to remain constant. The former condition may be stated 

N=Zf(i) At„ i.e. 8N =2 8/ (t) At* = 0. . (27) 

The energy of the particles whose representative points lie in the ith 
cell of energy u(i) is obviously N<u(i), so that the total energy is 

2u(t)/(i) Ar<, whence 8U = Z8f(i)u(t) At, = 0. (28) 

The variation of the entropy again leads to the expression 

- ^ = 0 = 1 8/(0 At, log/(») + S8/(»)A T| . . (29) 

If we again apply the method of undetermined multipliers by multi- 
plying the first auxiliary condition by a, the second by and adding 
the resulting equations to (29), we obtain 

l°g/(i) -f- 1 a. -f- y3u(i) = 0, • • • • (39) 


-a+*) . 

or setting e = a, 


-0u(i) 

9 t f • t 


/(*') = At 


t t 


(31) 
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The constant * A , which is connected with the multiplier a, is again 
determined from the total number of particles, for 

N=dZ e -'“ i0 At. ( 32) 

so that 

,, Ne~ $ “ ® 

<3S| 

The sum appearing in the denominator is called the partition sum of 

the system. We shall set this extremely important quantity equal 
to a: 

ri — &U (0 a 

a = Xe Ar, (34) 

To determine the multiplier p we must return to the entropy. Putting 

/to =~e (33') 

the entropy becomes 

S=kN log N — JcX-e pu(i) At, (logiV — pu (t) — log a). (35) 

On account of (34), and since 

^ / *\ A 77 

-Se “(»)Ar < = U ( 36 , 

our expression simplifies to 

S= hpU + JcN log a. ( 37 ) 

Icc 5 or^ e to n p a8 526 f r Cti0n ° f * 7 ™ 0bWn T ^ 

I = /9S\ 

t ( 0£ 7r 

Further, equation (36) shows that E7 and /J are functionally related 

/3S\ = dS/dp\ _dS 1 

\ S U /r dp \du) r dp ' 7dV\ ' * • ( 39 ) 

W/r 

p .1 IS. whe i^^aantjferdeDot^^ere by fwe“ 6 f^otto m '? ti P H8ra on 

two simple equations aw £ £ “ £ P on «« other Kf 

subscript, since it was wmovS lon ** «*** thTSSi 
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Carrying out the differentiation, we have by equation (37) : 


dS = W + JcB™ + kNd ° 


and from (34), 


dp 

da 


dB 


a d p' 

Ua 




Hence, by (39) there remains 



. ( 40 ) 

. (41) 



The distribution function then becomes 



jy e -u(i)lkT 



This is the celebrated Maxwell-Boltzmann Law of Energy Distri- 
bution. In the next section we shall learn several of its applications. 

Having determined ft, we obtain a simple thermodynamical inter- 
pretation of the quantity a\ putting the value 1/JcT for ft in equation 
(37), there results 

S = U T + kN log cr. 


However, the dimensions are not yet correct in this formula. As is 
seen from equation (34), cr has the dimensions of a volume of the phase 
space. On the other hand, the argument of a logarithm must be 
dimensionless. Now, in the classical thermodynamics S is determined 
only to an additive constant, so that we are at liberty to add con- 
stants to the right member in order to make its dimension zero. Such 
a constant is —kN log At, where At is, say, an arbitrary measure of 
the volume of the phase space element. Since the product of a position 
co-ordinate by a momentum co-ordinate has the dimensions of energy 
X time, which we call action, [h]. At has the dimension M 8 if 
there are s position co-ordinates. If we combine this added term 
with the term kN log a, we see that the unit of phase space volume 
must be taken to be [h]\ where the magnitude of [h]' is still unde- 
termined. But the Quantum Theory will show that [A] has a perfectly 
definite numerical value. We must therefore write 

s = 5 + wr io g[ |p (44) 


Then the free energy becomes 

J? = U — TS = — kNT log 'p. • • • ( 45 ) 
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5. Applications of the Maxwell-Boltzmann Energy Distribution. 

(a) Halley's Law and the equilibrium of sedimentation 

Assume that the energy of a particle of mass m depends only upon 
one position co-ordinate. For example, let the particle be in the earth’s 
gravitational field and have a potential energy mgz. Then, if we take 
as cells the parallelepipeds of base unity and altitude A z, the number 
of particles between the levels z and z -f- Az is given by (43): 


A AT/ \ A —mgzIkT. 

AN(z) = — e A z 

a 


(46) 


or the density is given by 




AN_N mgzIkT 
Az a 


(46') 


Instead of the function a we introduce here simply the density p A at 
the level z = 0, obtaining 

— mgz’k T 

P = Pa € ( 47 ) 

This formula includes the Halley formula (p. 192) for the varia- 
tion of density (or pressure) with altitude in an isothermal 
atmosphere, for if we replace p 0 in the exponent of equation (4) 

(p. 192) by the value M/v 0 — where M is the molecular weight 

and if we use the relation p 0 v 0 = RT y the exponent becomes 

arm 1041 that of e q uatiion (47), since it will be shown later that 
MjK = mjk* 

This formula is valid also for the distribution of visible colloid 
particles. If, for example, we are dealing with a mastic emulsion in 
water, the mass m is to be replaced by the actual mass of each particle 
minus the hydrostatic buoyant force. This means that in an emulsion 

T 8 . ™ d th f ^ umber of particles must decrease exponentially with 
the height, and this is actually observed. Thus, if the masses of indi- 

S f ° an be f 1 ° Und ^ an ? wa ?’ counting of the numbers 

of particles at various levels will serve to determine the Boltzmann 

constant k y and hence the Avogadro constant L by means of the^ 

lation k = R/L which will be derived below. 

(b) Angular distribution of the axes of elementary magnets in a 

magnetic field 

t i-/'V n0W P oss iWe to justify the formula used on p 465 for the 
distribution of the angle 0 which the axis of a dipole Vaktf w!th t£e 

<£nTa™ r effeTonfh ^ •» ■» points, the die- 

90 thttt th9 * tn. ..oo^^X on be of 
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field. The potential energy of a small magnet of moment p in a field 
H is given by 

u=— pH cos 6 (48) 


The phase space is, in this case, the surface of a unit sphere upon 
which we can mark the several axial positions by points. The division 
into cells is best made by taking zones between the angles 0 and 
6 4- A0. These zones, however, are not of equal size, since their areas, 
for equal intervals A0, are proportional to sin 0 A 0. We then have for 
the number of magnets whose axes make an angle with the field which 
lies between 0 and 0 -f A0: 



0 pH cos 0/ kT . n \ n 

27 rNe sin 0 

a 



Since there is no restriction on the size of the intervals in classical 
statistics, we may make them so small that the sum may be replaced 
by an integral. Hence in the denominator we have 




pH cos OlkT . Q j Q . A . 

e sin (50) 


The rest of the computation has already been carried out on p. 456 


(c) Distribution of velocity in an ideal gas 

We now investigate the distribution of the velocities of the mole- 
cules of an ideal gas, limiting our considerations to the translatory 
motion of the molecular centres of gravity. Any rotational motions 
of polyatomic molecules about their centres of gravity will not be 
considered. These rotations can have no effect on the distribution of 
translational velocities, so that the results hold not only for monatomic 
gases, but for any molecular structures. The momenta are, in rect- 
angular co-ordinates, 

p x = mx , p v = my, p t — mz, i.e. p — mV x 2 + y 2 + z 2 = me, (51) 


and we have 


2 m 


u = \mc 2 = ^ ( p x 2 + Vv 


+ V . 2 ) = 


_ V 


2 


2m’ 


• (52) 


The number of molecules whose centres of gravity he between x and 
x + Ax, y and y + Ay, z and c + Az, and whose momenta are 

between p, and p. + A p„ p, and p „ + Ap„, p. and p, + A p. is thus 
given by 

= iV g - ( p x -+p v '+j.. 1 )/ 2 ~*r Ax Ay Az Api A?v A ?I . (53) 
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Dividing both sides by the element of volume, we obtain the number 
An of molecules per cubic centimetre whose momenta lie between the 
given limits. If we again replace the summation by an integration 
the denominator becomes 


~ I f I I f 1 6 ^ xt+t>v ' +Pzt)l2mkT dxdydzdp x dp y dp % 


= V f f I e < ' Px '+ Pv ' JrPzt),2mkT dp x dp v dp t = VP. 


(54) 


The integral 7, which occurs here three times — with different variables, 
but of the same form — cannot be evaluated as an indefinite integral 
in tenns of elementary functions, but the definite integral between 
the limits — oo and +00 can be computed with the aid of a simple 
artifice. Set 


\/2mkT 


= dp x = \/2mkT d£. 


• • 


(55) 


The integral is then 


7 = VtoitiT [ + ~ e-Pdf. 

•7 — 00 


. (56) 


If the same integral is now written with the variable of integration v 
and multiplied by the integral in f, we obtain 


72= 2 mkT J e-*'d£ f^e-^drj 


00 

= ZmkT jT* £tr*+*d(di. . . (57) 

Now the double integral is readily evaluated in polar co-ordinates- 

If we aUow the, po ar angle to go from 0 to 2* and the radius vector 

°° ’ the P la “ e be completely covered, just as if the rect- 
angular co-ordinates f and r, were aUowed to vary between the limits 
—co and -f- oo . Hence 


.oo 


I = i^nkT j o e ->‘ p d P = - 2 nmkT[e~^ = MT. (58) 
Thcn V (2» mkT)*'* (59) 

We are less interested in the distribution of the comvanenls of 
momenta (or of the velocities) than in that of the magnitudes If the 

l obtain bv ran< ? r+ Ar ‘ Obviously, this number 

AxAyAs included Sn tL 

since, according to (53), the densitv of' “7 “?, ^ 
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Px 2 + P 2 + V 2 = P 2 > and so is constant over the surface of a sphere. 
We thus have the sum given by the distribution function multiplied 
by the total number of elements contained in the spherical shell of 
this momentum space, i.e. simply the distribution function into the 
volume of the spherical shell: 



Ne pl,2mkT .4:7rp 2 Ap 
F(2t rmkTf' 2 



If we express p in terms of c and also set N /V = n we obtain the final 
formula 


An 

n 



= f(c) Ac. . . . (61) 


The distribution function /(c) is plotted in fig. 1 for the case of 
nitrogen. Since this function vanishes at c = 0 and at c = oo 9 it must 



have an extremum at some point. In order to locate this extremum 
which, in this case, is a maximum — we introduce the notation 


_ l2kT 

V ni 


(62) 


for convenience. Setting the first derivative equal to zero gives 


0 = 

dc a 3 t 


8c 


from which 


m 


= a= / 


a 6 y / tt 

2kT 


. (63) 


m 


This may be termed the most probable molecular velocity. Mor* 
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over, we also wish to compute the mean velocity. This is given by 

c= -XcAn = - 5 -^ 7 — f c 3 e" 6 * 1 * 1 dc = f x*e~ x 'dx 
n a 9 y/ it Jo y/irJo 

= — r - f x*er x 'd{x 2 ) = — ^-\x 2 e~ x '~\ -f- f e“®*- 2 xdx 

y/7rJo Y7rL Jq \ttJq 

= 2a = 2 v 

y/n y/TT 


Finally, the mean square velocity is 



1 4 /•* 9rr 2 

- 2c 2 An = - 0 — 7 - f dc = ~ f x 3 e~ x 'd[ x 2 ) 

n a?y/7rJ 0 y/nJo ’ 




3a 2 

2 VirJ-* 


e~ x *dx 


3a 2 y/ 7T 

"2V^ 


3a 2 

2 


= |c m 2 ; V<* = l-225c„ 



We have thus derived the relation between average velocity and root 
mean square velocity which was given without proof in Chap. XXXIV. 


(d) The equation of stale of an ideal gas; Boltzmann Constant and 

Gas Constant 

If the value obtained above for the function a of an ideal gas is 
inserted in the equation for the free energy (p. 588), one obtains 

F= — kNT log ^ = - JcNT log{F (ZvmkT)™} + kNT log [A] 3 . (66) 
Using the second equation of (34) (p. 527), viz. 

/a f \ 

^ = -( 97 ), ( 67 > 

we obtain V = ^ (68) 

or for one mol (N — L), 

pv = JcLT = RT (69) 

Therefore, as already mentioned, h is the gas constant per molecule: 

1 e. — ? __ 8*313 X 10 7 erg/deg/mol 

6-02 x 10 23 1,380 x 10 “ 16 ^g/deg. (70) 
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(e) Osmotic pressure 

On p. 547 this phenomenon was treated from a thermodynamic 

pomt of view. It is possible to attain the same result by a statistical 
approach. 

Unlike the pressure exerted by a gas, evident as the result of mole- 
cular impacts with the walls, osmotic pressure is less readily visualized. 
W e can think of the membrane as a field of force afiecting only the 
molecules of the solute— an assumption not in the least artificial since, 
in the last analysis, the effect of the walls of a vessel of gas is deter- 
mined by the field of force of the molecules constituting the walls. 
This assumption is particularly apt in the case of a membrane, for there 
must be a certain thickness of membrane material within which the 
selective permeability comes about. A particle approaching the wall 
will then obtain a potential energy u which rises sharply toward in- 
finity as penetration occurs. The force exerted on the particle by the 
wall is given by 

F = — gradw (71) 

Inasmuch as u increases as the particle approaches the wall, F must 
be directed into the solution, and according to Newton’s Third Law 
the particle exerts an equal and opposite force on the wall. Quali- 
tatively at least, the situation is the same as for the phenomenon of 
gas pressure. It remains to show that the gas laws actually hold for 
osmotic pressure. 

For the present, assume that equilibrium has not yet been reached 
on the solution side. In this condition the full pressure excess does not 
yet exist on the solution side, and the force field of the membrane 
repels the particles. Now, it is known that when a molecule or an ion 
moves in a liquid, viscosity causes some of the solvent to be dragged 
along (see p. 419). The liquid thus streaming away from the membrane 
is replaced by fresh solvent which passes through the membrane, for 
no barrier exists in so far as the solvent is concerned. The solvent 
streams in until the pressure excess equals the osmotic pressure. At 
equilibrium the Boltzmann distribution law holds, which means that 
the number of particles of energy u in each cubic centimetre is given by 

n = n 0 e~ ulkT . U-) 

Here n 0 is the number of particles per cubic centimetre at a point in 
the solution far from the membrane. The force on an element of volume 
of the solution containing n particles of energy u per cubic centimetre 


is 


fdv = — n grad u du. 


. . (73) 
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It follows from (72) that 



no 

kT 


e~ u,kT 


grad u = — 


n 

kT 


grad u. 



Substituting into (73), the resulting expression for the force per unit 
volume becomes 


f — kT grad n. 



According to the fundamental hydrostatic equation (p. 191 ), we have, 
at equilibrium, 


Integrating, 


gradp =f—kT grad n. 
V = kTn + p 0> 


(75) 

(76) 


where p 0 is the pressure corresponding to n = 0, i.e. the pressure on 
the solvent side. The osmotic pressure is p — p 0 . If one mol of solute 
is dissolved in a volume v of solution, then n = L/v and 

kL _ RT 

Fosm — v ^ • \l i) 


Thus we have deduced the van’t Hofi law (p. 547) on a statistical basis. 


6. The Law of Equipartition and its Application to the Specific Heats. 

The energy of a single particle is, in most cases occurring in prac- 
tice, a homegeneous function of the squares of a number of position 
or momentum co-ordinates to which we give the unified designation 

*2 • • \ x f- Th® total number/ of parameters upon which the energy 
depends is called the number of degrees of freedom. This is unlike 
the usage in mechanics, where the number of position co-ordinates 
alone was called the number of degrees of freedom of the system. 
Under the condition that we have a homogeneous quadratic energy 
function, Euler’s theorem (p. 531) is valid: 


3u 3« . du 

Xi ^ +Xi ^ + --- X 'di, =2u - 


3 ®, 


(78) 


Passing directly to integration instead of summation, the total energy 
of the system is given by 


V =JudN = y ////- • • e ulhTudx i dx 2dx 3 ■■■<&> = NZ 

I 1 1 1 • • ■ e ~ ulkTdx . dx t a 


( 79 ) 


/•♦Wvvr 
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The numerator Z may be transformed with the help of Euler’s theorem 
We have 


22 ~ Jiff ■■ e “ lkTx i ^ ^1 dx 2 ... dx, 

+ f f f f * • • e ~ ulkT x 2 dx l dx 2 . . . dx f 

+ • • • + f f f f ••• e- ulkT x f dx 1 dx 2 . . . dx t . 

In the first integral we integrate with respect to x 1 by parts: 

f e ~ U ' kTXl dx * = -W[*i*- ulkT ] + kTf< e-^T dXl . 


(80) 

(81) 


In general, the limits are — co and -foo— at any rate, there are two 
limits of equal magnitude in x ly but of opposite sign. Then, since 
u is proportional to a^ 2 , the bracketed expression vanishes when the 
limits are substituted, so that only the integral remains on the right 
side. Substituting this in (79), the first term yields kTo. If we proceed 


in this way with every term, we obtain 

2 Z = fkTo (82) 

Hence 

D= NfkT =Nfu (g3) 


This is the celebrated Law of the Equipartition of Energy in classical 
statistical mechanics: Each degree of freedom which enters quadrati- 
cally into the expression for the energy of the entire system contributes , 
on the average , kT/2 to this energy. 

This theorem may be verified in a simple manner by applying it 
to the calculation of the specific heats of gases. Since 

•■=(&).• m) 


the molecular heat becomes, according to (83), 

_W_fR 
v ~ 2 2 * 



Since R is approximately equal to 2 calories per degree, 

c v &f. 

The degrees of freedom of a monatomic gas are the three components 



597 


XXXV.] CLASSICAL STATISTICS OF BOLTZMANN 
of the translational velocity ; the specific heat per mol is then 

3 R 


whence 


c p = c v -f R = 


5 R 
2 ’ 




( 86 ) 

(87) 

( 88 ) 


for any monatomic gas. These relations have been verified from the 
lowest to the highest attainable temperatures. 

As a model of a diatomic molecule we take a rigid dumb-bell.* In 
addition to translational energy, the molecule has rotational energy. 
The rotation, too, may be resolved into components about three axes. 
The rotation about the figure axis itself does not contribute to the 
kinetic energy, since the moment of inertia about this axis is zero; 
hence there are two additional degrees of freedom of rotation for a 
diatomic gas, and we have 



5 R c,_ 7 

2 ’ c v ~ 5 



This relation is also generally valid at room temperature. Neverthe- 
less, it has been observed — especially for hydrogen- — that the specific 
heat drops to the value for monatomic gases as the temperature is 
lowered. This does not agree with the Law of Equipartition. 

Molecules consisting of more than two atoms have three degrees 
°f rotational freedom, so that as long as we consider such molecules 

rigid (internal vibrations absent), the specific heat at constant volume 
is given by 



612 Cj, 4 

2’ c,~3’ 



which holds for any such polyatomic molecule. 

As we saw on p. 575, the atoms of a solid body are bound to their 
equilibrium positions by quasi-elastic restoring forces. Thus an 
atom possesses not only kinetic energy but also potential energy 
which, by the results of p. 577, is also a quadratic function of the posi- 
tion co-ordinates. This means that for an atom of a solid there are six 


f two atoma may also vibrate in the line joining them. The reason 

will 40 4116 8pe0ifi0 h64t at temperatures 
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degrees of freedom contributing to the energy, whence 


C v = 


6R 


. (91) 


This is the same as the empirical Dulong-Petit Law of Atomic Heats. 
At room temperature this law is in general well obeyed — only the 
lighter elements (e.g. diamond) show departures. At lower tempera- 
tures, however, the specific heats decrease — as a matter of fact, to 
zero — which is what the Nernst Heat Theorem requires. But this 
decrease is also contrary to the Law of Equipartition. Thus it seems 
as if this law is valid at higher temperatures, but that at lower tem- 
peratures an ankylosis * of degrees of freedom takes place. This indi- 
cates that the classical statistics must be modified in some way. 


9. Fluctuations. Transition from Microscopic to Macroscopic Motion. 

If thermodynamical equilibrium, corresponding to the maximum 
value of the entropy, is of a statistical nature only, departures from 
the most probable values are to be expected when very small regions 
are observed. For example, if it were possible to observe an element 
of volume containing but one hundred molecules we would observe 
relatively large fluctuations of density in this space ; if but one addi- 
tional molecule should enter, the density would be increased by 1 
per cent. These variations in density are responsible for the scattering 
Df light in the atmosphere. A knowledge of the theory of this scatter- 
ing makes it possible to calculate Avogadro’s Number from the spectral 
energy distribution of skylight. Again, imagine a small, movable 
surface element in the wall of a vessel containing a gas. Let the element 
be so small that but a few hundred molecules strike it each second. 
While the element would not respond to each impact separately, the 
likelihood of considerable pressure fluctuations is no longer small, and 
an irregular fluttering of the element is to be expected. This pheno- 
menon was discovered about a century ago by the English botanist 
Brown, but its nature remained unexplained for a long time. If we 
observe very small, barely visible particles suspended in a liquid 
(colloid particles), these exhibit an irregular vibratory motion because 
of the fact that the impacts of the molecules of the liquid do not exactly 
balance at each instant, so that the particle is urged sometimes in one 
direction, sometimes in another. Looking upon the colloid particle as 
a large molecule, its kinetic energy must, on the average, be equal to 
JcT/ 2, according to the Law of Equipartition. What we observe in 
the microscope, however, are velocities of much lower order of magni- 
tude, if we take the velocity to be the distance covered divided by the 

* Poincar6 borrowed this very apt term from pathology to denote the “ freezing 
np ” of a degree of freedom. 
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time. In reality, however, the velocity alters its direction so often 
that the observed motion is merely a smoothed-over version of the 
actual zigzag path. The path of such a particle is suggestive of a type 
of curve occurring in higher mathematics — a curve which is everywhere 
continuous but yet has no tangent. It is quite hopeless to attempt — 
say by cinematography — to ascertain the details of the motion in order 
to determine the actual mean kinetic energy. Rather, if we wish to 
give a theory of the phenomenon which is amenable to experimental 
test, we must consider the mean displacement instead of the velocity. 
Consider only displacements in one dimension. Let there be N par- 
ticles per unit volume between x and x + dx at the time t == 0. After 
a given time r has elapsed, we consider a volume element of the same 
size at the point x'. Particles not within this volume element at first, 
but now inside it, have entered from the neighbouring elements. The 
probability that a particle enter from an adjoining element is, of 
course, a function of the distance x' — x, and the difference in 
time r of the two observations. Accordingly, we denote it by 
f 7 (xf — x). The function f T includes also the case in which the 
particles were previously in the element x' if we set x' — x = 0. 
Since the particles must come from some volume element, the 
density at x' at the time r is 


N (x\ r) = f N(x, 0)/ T (x' — x)dx. 

•'—cc 



Introducing the displacement X — x—x\ 

N(x',t)= [ + \N(x' + X,0)]f T (X)dX. . . (93) 

^ —CO 


The function f T is even, since positive and negative displacements are 
equally probable: 

/ T (X)=/ T (-X) (94) 

We^now develop the left member in powers of r, the right in powers 


=£■{*(*', o)+xg +i ^g + ...} /r(Z ) dz ... {95) 

The derivatives dN/dt, dN/dx . . . have fixed values for the time t — 0 
the internal^ ** 0n account of the symmetry of the function / T (X), 
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vanishes; further, since each particle which enters must come from 
some other element, the integral 



Dividing by r, we obtain from equation (95): 


dN 

dt 


1 a 2 N r+- 


2t a x*J_ 


f X%(X)dX = 

* / — co 


ia 2 w 

2t3x 2 




The integral on the right represents the mean square displacement 

X 2 , since it is the sum of the squares of the displacements, each being 
multiplied by the probability of its occurrence. Now it is readily 
shown that the mean square of the displacement is proportional to the 
time interval of observation, so that X 2 /2r is constant: if we observe 
the instantaneous positions of a particle at equal time intervals r which 
are not too short, we may look upon the individual displacements as 
independent, on account of the random nature of the impacts.* 
After a series of p observations the resulting displacement is 

X<P) = X l + X t = X 3 + ...X,. . . . (97) 


The square of the displacement is then 

(X^f = (X x + x 2 + . . . x v ) 2 = ix 2 + 2iixrX 9 . m 

1 1 1 


If we now average over the displacements experienced by a large 
number of particles, the double sum will vanish on account of 
the equal probability of positive and negative displacements, and so 
we have 


(X^) 2 = pX 2 



This means that after the time pr has elapsed we observe a mean 
square displacement of magnitude pX 2 , i.e. p times the value corre- 
sponding to the time r. Thus X 2 is proportional to the interval of 
observation. 

Equation (96) is neither more nor less than the well-known dif- 
ferential equation of diffusion, which is analogous to the equation of 
heat conduction, concentration taking the place of temperature. 
Thus, if the concentration varies from place to place, a stream of 
particles will flow from the points of high concentration to those of 


* If the observations are made too frequently, a given velocity of the particle will 
still persist after the instant of observation and so will affect the following °bs er vaUon. 
In a practical case, however, it is hardly possible to make observations at such short 


intervals. 
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low concentration. We set the density of this flow proportional to 
the slope of the concentration, i.e. to — D grad N , or in the one- 
dimensional case, — D(dN jdx). The change in the number of particles 
within a volume element dxdy is due to the entry of particles from one 
side and the departure of particles from the other. At the point x a 
current 



enters and at the point x-\- dx the current 




leaves. The excess of the number entering over the number leaving 
is the rate at which the particles are diffusing into the volume element, 
so that 


dj[ 

dt 




or 



. ( 100 ) 


Thus (cf. (96)) 




represents the diffusion coefficient of the particles. If the particles 
are in a field of force, e.g. the earth’s gravitational field, the Boltz- 
mann equilibrium distribution sets in. This is given by 


N=N 0 e 


- ulkT 


( 102 ) 

If the force is constant, the potential energy is given by Fx. We may 
imagine the equilibrium to result from the motion of the particles due 
to the action of the force together with a diffusion current which seeks 
to satisfy (102) The velocity of a particle which is in equilibrium under 
the action of the applied force and viscosity is given by Stokes’s Law 


v = 


1 7 


§7rr)a 


(103) 


“Jl, the t “S" the particle and 71 k the coefficient of viscosity 
of the solvent. The density of this stream of particles is then 


s= NF 

67 rqa 


(104) 

t™?!. 0f J eqUal stren S th the opposite direo 

tending to keep the distribution of concentration constant. Th< 
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density of this flow is given by — D(dN /dx). Equating these two flow 
densities, 


_ p 32V_ NF 

dx~ 6t TTja 

On the other hand, it follows from (102), for u = Fx , that 

13 N _ F 

N dx~~ kT 


(105) 


(106) 


Inserting this expression in (105) and using (101), we obtain the im- 
portant equation 


or 


D 

kT 



(107) 


Z 2 = kT 

T 377-77a 


(108) 


This is a general expression for the mean square of the displacement 
of a particle.* 


* The above theory was developed bv Einstein, and the final formula (108) was 
experimentally checkod by Perrin and others. Consult A. Einstein, Ann. der Physik, 
17 , p. 549 (1905); E. Bloch, The Kinetic Theory of Oases, English translation by P. A. 
Smith (London, Methuen & Co., 1924). Einstein also developed a formula for the 
rotational Brownian movement experienced by particles of irregular shape. This 
formula, too, was verified experimentally. This is discussed by Bloch, loc. cit. 



CHAPTER XXXVI 
The Classical Quantum Statistics 

1. The Quantum Condition for the Subdivision of the Phase Space of 
an Oscillator. 

As we saw in § 6 (p. 595) of the preceding chapter, the Law of 
Equipartition leads to conflict with experiment when applied to the 
calculation of specific heats. The results are entirely incorrect when 
the theorem is applied to a volume filled with radiation (see p. 622). 
In connexion with this problem, Planck was led to advance the hypo- 
thesis, in 1900, that a simple, linear, harmonic oscillator can absorb 
energy only in integral multiples of a minimum quantity of energy 
which is proportional to the frequency.* Thus the energy possessed 
by such a system is always an integral multiple of the quantity hv: 

u n = nhv (1) 

The factor of proportionality, h , is the most important of all con- 
stants in atomic physics. It occurs universally, and is not connected 
merely with the linear oscillator, as might be inferred from the manner 
in which it was introduced. Its numerical value is 

h = 6*62 x 10 -27 erg sec. 

This constant is usually referred to as the Planck quantum of action, 
since it has the dimensions of action (energy x time). We encountered 
this constant for the first time in the theory of discharges in gases 
(p. 432). The greater the natural frequency v, the greater are the 
intervals in energy content, and the greater is the difference between 
the quantum theory and the classical statistics, where a continuous 
senes of energy states was tacitly assumed. 

What is the significance of the Planck hypothesis for the phase 
space of a system consisting of oscillators! It is obvious that for an 
arbitrary division into cells there can no longer be equal probability 
tor cells of equal size, for if we mark the points in which the oscillator 
baa exactly the energy nhv, it will always be possible to find volume 
elements contammg no points (i.e. elements whose probability is zero) 
it the subdivision is arbitrary. Nevertheless, it is possible to preserve 

* M. Pl&nok, Ann. dtr Phynk, 4, p. 653 (1901). 
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the validity of the Liouville theorem by subdividing the phase space 
in a particular way. For the linear oscillator, which has but one posi- 
tion co-ordinate and one momentum component, the phase space is 
the yg-plane. The potential and kinetic energies are 


ifq 2 and 



respectively. In an energy state in which the total energy is nhv we 
have the relation 


or 


nhv = \fq 2 + 






2 mnhv 




This is the equation 


of an ellipse whose semi axes are 



\/2 mnhv. 


This ellipse is termed the 'phase orbit , since the phase point traverses 
this path once during a complete vibration of the oscillator, as long 
as the energy remains constant. The area enclosed by the phase orbit 
is given by the phase integral (cf. p. 129): 


Since 

this becomes 



In the case of the oscillator , the Planck hypothesis leads to the condition 
that only those states occur for which the value of the phase integral is 
an integral multiple of h. This is the general formulation of the quantum 
condition. It is only by chance that for the oscillator this condition is 
equivalent to the energy condition; for other systems the two con- 
ditions are quite distinct. If we now draw the ellipses corresponding 
to n= 1, 2, 3. ..we obtain a division of the plane into elliptical 
rings, each of area h. If we ascribe to each of these rings the energy 
value corresponding to its inner boundary, the innermost ring has 
the energy 0, the next one 1 hv, &c. Thus we again arrive at a division 
of the phase space into elements of equal size; if all quantum 
states aro looked upon as equally likely, each element has the same 
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a priori probability and we can proceed as in the classical statistics. 

If the elementary particle has several co-ordinates, the condition 
that the phase integrals be integers holds for each co-ordinate, and 
the element of phase space becomes h s . Thus the value of the dimen- 
sional constant [K\ introduced on p. 588 is determined. We write h 
for it in the future. 

The chief difference between the present procedure and the classi- 
cal statistics is that the prescribed division of phase space into cells 
of definite size no longer permits the replacement of the partition 
sum a by an integral, since the latter implies a continuous change of 
energy. We thus obtain for the partition sum of the oscillator (p. 587) 

a= hI t e~ nht ' lkT = h{ 1 -f e~ h ^ kT + e~ 2hl ' lkT -f e -* kylhT + . . .). (5) 

The sum of this geometrical progression is 

h 

a ] e -h*ikT • • • • • • • (6) 

This expression for a has important applications to specific heats, as 
will be shown in the following section. 


2. Temperature Variation of the Vibrational Contribution to the Specific 
Heats. 

(a) Diatomic molecules 

The two atoms which constitute a diatomic molecule are not 
rigidly connected to one another, but execute vibrations about their 
common centre of gravity in the direction of the axis of figure. The 
evidence for these vibrations is obtained from observations on band 
spectra (cf. p. 680). Why do not the two degrees of freedom of vibra- 
rional kinetic and potential energy contribute to the specific heat, 
which apparently comes out correctly when only the translational 
and rotational degrees of freedom are taken into account? The quan- 
tum theory furnishes information on this point. With regard to the 
vibrational contribution to the total energy, we can consider the 
diatomic gas to be a system of linear oscillators, since only vibrations 
along the ass of figure of each molecule are possible. We have already 
calculated the partition sum for an oscillator. The free energy of one 
mol of such oscillators is then (p. 588) 


/ — kLT log £ = -~RT log h + til log (1 - e -***i) + rt l og A, 


( 7 ) 


whence, by differentiation, 


(rr)„ = Rl ° g(l -•-*«) 


RTe~ MkT hv 

(1 _ e -M T) 


. ( 8 ) 
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» = f — T ( _f\ - Rhv 1 

\dT/ v k{ l — e~ f "’i kT ) k (e h *i kT -l)' 

For high temperatures, the exponential function in the denominator 

may be developed in a power series ending with the second term, thus 
giving 

l-T 

u=RT=L. 2y (10) 


But this is the value of the energy corresponding to the two degrees 
of freedom according to the equipartition of energy. The classical 
statistics remains valid when the temperature is high ; at low tempera- 
tures the finite differences of the quantum states manifest themselves 
by the fact that their contributions are not as large as the classical 
value. The contributions vanish entirely for very small values of T, 
and this degree of freedom is “ ankylosed ” (p. 596). It is evidently 
a question of the ratio of the temperature to a temperature 0 which is 
characteristic of the oscillator: 



We have 




1 _p/©Y e°' T 

\dT/ v dTe Q ' T -l \T/ (e^ T - l) 2 



where we introduce the function 



(12a) 


For small values of x, i.e. for high temperatures, the value of O ap- 
proaches unity, while for large values of x , i.e. for low temperatures, 
<I> vanishes. At room temperature (about 300° abs) the state of affairs 
is this: 


For x > 10, 0>(x) < 0*005; 0 > 3000°, 

i.e. v > 6*2 X 10 13 sec. -1 or A < 4*8 p. 


For z < 0*5, O (x) > 0*975; 0 < 150°, 

i.e. v < 3*1 X 10 12 sec. -1 or A > 95 p. 

The accompanying wave-lengths are those of electromagnetic waves 
of the given frequencies; these are measured spectroscopically as 
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the infra-red characteristic radiation of the system. Thus we see that 
an infra-red characteristic vibration whose wave-length is less than 
4-8 {x is practically absent at room temperature. This is actually the 
case for diatomic gases. The function O (x) steadily decreases from 1 to 0 
as x increases from 0 to oo . Hence, according to equation (12) there must 
be an increase of the specific heat as the temperature is raised. This is 
observed for HC1, in good agreement with the theory. If we insert 
in (12) the frequency corresponding to the infra-red characteristic 
vibration at A = 3*47 ft, there results 


T 

| 

c v calc. 

e v oba. 

1600° 

6-41 

6-46 

2000° 

6-66 

6-68 

2600° 

6-82 

6-90 


(6) The solid state 


If we represent the structure of a solid by atoms vibrating about 

centres of equilibrium, we must remember that there are three direc- 

faons of vibration; we used this fact in deriving the Law of Dulong 

and Petit. This means that one mol of a solid substance is equivalent 

to 3 L oscillators, and we obtain for the molecular heat of a solid 
exactly as above, ’ 





e BIT 

(e*/T - 1)2 




In this instance the characteristic temperature 0 is generally so low 
that all vibrations are fully stimulated at room temperature^ and so 
the limiting value 3 R holds for c,. Since the mass occurs in the de- 
nominator of the formula for it is only for the lightest TeCnte that 

the characteristic frequency is so great that the full value of o is not 
attained at room temperature * not 

tenstic frequencies, we obtain so many free consents 

'iSaSSf5«sS 

to one lattice point is transmitted to tta dj f tob “ oe applied 

are dealing with a system of SHC'TS?. 
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this case is rather troublesome, but has been carried out by Born and 
Thirring for simple lattices.* 

Ex. 129. Determine, according to the conditions given above, the quantum 
levels of the rotational motion of a rigid dumb-bell (diatomic molecule) of 
moment of inertia 7. Investigate the falling off in the rotational part of the 
specific heat of 77,. 


3. The Debye Theory of the Specific Heats of Solids. 


The law of variation of the specific heats of solids is obtained with 
much less calculation by a method due to Debye. In Debye’s theory 
the atomistic structure is thrust into the background and the coupled 
vibrations of the system of oscillators are considered to be elastic 
waves which traverse the body and are reflected at its boundaries, 
thus giving a system of standing waves. A wave corresponding to a 
given frequency has the same determining properties as a linear 
oscillator; firstly, a frequency v, and secondly, two energy components, 
corresponding to the potential and kinetic energy of the oscillator. 
For elastic waves there is a part corresponding to compression and 
one to motion; for electromagnetic waves we have the energy den- 
sities of the electric and magnetic fields. These properties completely 
determine a longitudinal wave, but for transverse waves the plane 
of polarization of the vibrations must be specified also. This may be 
determined by resolving the amplitude along two mutually perpen- 
dicular directions. Thus an unpolarized transverse wave corresponds 
to two oscillators, one for each component of the vibration. 

Following a method given by Rayleigh and Jeans, we now proceed 
to calculate the number of standing waves which can be contained in 
a body whose volume is V. For simplicity, let this be a cube of edge l. 

Imagine a wave travelling parallel to the x-axis. This wave will 
travel to and fro between the walls which are parallel to the yz- plane. 
A system of standing waves results from the superposition of the inci- 
dent and reflected waves. There is either a node or a loop (antinode) 
at the wall, depending upon the nature of the waves and of the walls. 
It is immaterial, in the following computation, whether one or the 
other condition exists. For electromagnetic waves and metallic re- 
flecting walls, for example, there must be a node of the vector E at 
the wall. In the problem of the specific heats we deal with elastic 
waves representing the vibrations of the lattice points. Only those 
waves can travel in the x-direction for which l is an integral multiple 
of half the wave-length (cf. fig. 1). Calling the phase velocity c, we have 



• Consult M. Bom, The. Mechanics of the Atom, Eng. tram and revision by J. 
Fisher and D. Hartree (London, G. Bell & Sons, 1927). 
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For a cubical body, the same relationships are valid for waves 
in the y- and z- directions. But these are not the only possible 
standing waves. Consider, for example, a wave whose normal 
n = -\-j cos ft -f - k cos y lies in the yz-plane. 

This wave will be reflected from the xy-plane, 
and we must consider the reflected wave as 
well as the incident wave. Now the super- 
position of the two is, by p. 64, equivalent to 
a wave moving parallel to the y-axis with the 
phase velocity c' = c/ cos/?, whose amplitude in the direction of the 
2 -axis is modulated with the wave-length A/ = A/cosy. The boundary 
condition for the planes parallel to the xz-plane thus demands that 

2lv 2lvcosp 



Fig. i 


should be a whole number, and the boundary condition for the faces 
parallel to the ary-plane requires similarly that 

2 1 2 lv cosy 

A7 c 

should be an integer. 

Consider now a. wave whose normal direction is quite arbitrary: 

n — cos ai+ cos fij + cos yk. The superposition of the incident wave 
and that reflected from the yz-plane yields 

+ A' — ^ O e^(<-(*co««+y co«0+*cosy)/c)_|_ J^gi^t+xcosa/c- (y co./3+«co.y)/ f ) 


— 2 A 0 cos ^ 


/ tux cos 



(* — ( y a»0+xco*y )lc) 


TWs wave, whose front is modulated in the direction of the a-axis, is no 
reflected at the za-plane; superposition of the two waves then gives 

A A A" — 2 A 0 COS COSa ^ cos/j/e— * ctMflc) 

4* 24 0 COS C0S g^*(«+y co«fl/e— Mcotylc) 

= 4 A 0 cos cos ^ 

A+A’+A" + A'" 6 


0711 ) 
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The fulfilment of the boundary conditions at the walls requires that 

*lvl cos a 2 vl cos B 2vl cosy 

— c — = ni * — 3 — = n 2> — = • (16 ) 


or cos a = 

Squaring and adding, 


cr 4 
2 vV 



n x 2 t ? 2 2 -f n 3 2 = 



. (16') 

• (17) 


To every frequency v there correspond triads of positive integers which 
satisfy (17), and to each triad there corresponds one direction which 
may serve as a wave normal. We now seek to determine the number 
of waves between the frequency limits v and v + dv. In order to 
compute this number we need only look upon r^, n 2 , and n 3 as rect- 
angular co-ordinates. Then (17) represents the equation of a sphere 
of radius 2 vl/c. Since one integral lattice point corresponds to each 
cube of edge unity, the number of allowed vibrations in the range v to 
v -f dv is evidently equal to the volume of one octant * of the spherical 
shell included between the radii 2 vl/c and 2(v4- dv)l/c. This is 


dZ=-.\. 


4 7 r . 4v 2 / 3 . 2 dv 477 V 2 / 3 


dv, 


• • (18) 


or, per cubic centimetre of the body, there will be 

f = 4 -^, (18') 

vibrations between the limits v and v dv, and this number is inde- 
pendent of the particular form of the body, which fact may be rigorously 
demonstrated. 

For the elastic waves corresponding to the thermal agitation of 
the atoms of a solid we have (p. 181) longitudinal as well as transverse 
waves. These have different velocities, which we denote by c t and c, 
respectively. On account of their two azimuths of polarization, the 
transverse waves must be counted twice. Accordingly, we have for 
the total number of waves in the volume v of one mol, between the 
frequencies v and v -j- dv, 

d z — 47tu v 2 ( — 4 • ( 19 ) 

V» c t J 

According to this formula, the number of waves increases as the square 
of v and without limit, so that the number of oscillators becomes 
infinite, while in reality one mol of a solid contains only 3 L oscillators. 
Debye arbitrarily assumed that the total number of waves should be 

• Since only positive integral are allowed. 
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precisely 3 L. This assumption, which at first sight seems rather 
forced, is justified by its results. Moreover, it results quite automati- 
cally by the rigorous computation for a system of coupled oscillators 
(see end of the preceding section). The total number of waves is ob- 
tained by integrating over all frequency intervals from 0 to the 
limiting frequency v Q . Thus v g is determined by the equation 




If we treat each wave as an oscillator we can proceed exactly as in 

§ 2 (p. 605). The intrinsic energy of a system of N oscillators of 
frequency v is, by equation (9) (p. 606),* 


t _ y 

Thus for dz oscillators we have an amount of energy 

du= J __ hv*dv 

\C? C?) ef lvlkT — 1 Vg * ghvIkT — y 

and the total energy of one mol is 

9 Lh 

•'o 

We introduce the following variables into the integrand: 

S =-^, ^.<-=0 * = © 

kT k 9 9 kT T' * * 


( 21 ) 


11 = f * 

v 0 * Jq e hylkT — 1 


( 22 ) 


(23) 


Tims v, enters into the characteristic temperature in place of the 
haractenstac frequency v of the preceding section. We then have 



r 


a?dx 


(24) 


For small values of x, i.e. high te 
denominator in a power series yields 


si 


Q$dx 


* J °x+*+? + 

2 ~ 6 r * * * 


+ *- 
12 


£• 


• • • 


+ 


dx 


= 1 __ 3 , X 2 

3 8 60 

luafcion (9) we were dealing with L « 


( 25 ) 
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and 


u = 3RT — 

8 ~ 20 T 


(26) 


If the series is terminated with the first term, differentiation again 
gives the specific heat in harmony with the Law of Dulong and Petit. 
The higher terms thus give the drop in the specific heat for smaller 
temperatures. 

For the lower portion of the curve c v = f(T), i.e. for large values of 
z, we can write co as the upper limit, since the high values of x con- 
tribute very little to the integral on account of the occurrence of e* 
in the denominator. The power series development of x 3 (e x — 1) _1 
yields 

x 9 (e x — l)" 1 = x 3 e~ x (l — e~ x )~ l = x 3 e~ x + x 3 e~ 2x -f rrV 8 * -f . . 
and term by term integration between 0 and oo gives the series * 

T(x) = l( 1 + 1 + $ + & + ■ • •)• 


The sum of the series X (1/n 4 ) is 7 t 4 /90 = 1*0825. Finally, in the neigh- 
bourhood of the zero point, we have 


and for c v near T = 0, 


u = 58-45 RT . 

03 


c. = 233-85 R 


(27) 

(28) 


This is the empirically well substantiated T 3 law for the specific heat 
of a solid. 

For further details, consult P. Debye, Ann. der Physik, 39, p. 789 
(1912). 


♦Sine© 




CHAPTER XXXVII 

The Theory op Thermal Radiation 

1. Analogy between a Gas and Radiation filling an Enclosure. Radia- 
tion Pressure. 

The experiences of everyday life show that a hot body radiates 
light, i.e. electromagnetic waves. The problem of calculating the total 
amount of radiation emitted by umt area of a surface per second, and 
of determining its spectral distribution, is thus as much a matter for 
the Theory of Heat as for Electromagnetism. 

First, we must ask whether the radiation of a hot body depends 
only upon its own properties and not also upon those of the environ- 
ment. At first sight it might appear that, in analogy with thermal 
conduction, the radiation sent to a cool neighbouring body is more 
intense than that sent to a warmer body. An important advance in 
the solution of the problem formulated above was made by PrSvost 
when he recognized that the rate of radiation is independent of the 
surroundings, and that the apparent dependence upon the environ- 
ment is due to the fact that the cooler body in turn sends energy back 
to the radiator but at a smaller rate than does a hot body, so that the 

more°raSdl Mdl r t °^ hicl1 “ P re , 3ented to the colder body cools ofi 
more rapidly. In this manner the number of variables upon which 

we 1 a A° 1 t b ° dy de P 6nds “ considerably reduced 5 Further 

we shall find that the radiation from any body may be computed if 

EE2: 2 aSSSJS* ' 22 % 

bk » the p , rob ! e ™ of calculating the radiation from a perfeotlv 

wl ‘£ atTtSbTt® " ^ ° aUed a or 

by e e d leSj^etic n wivt Satedfao “ ’>5^3 

atSL^rsssi ? ~ smks : 

be changed at will so that w/uv ©^closed in a volume V may 

in addition to the £ 

Oil ’ * 
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energy density of the confined radiation is already completely fixed 
by the temperature. But just as the atoms of a gas exert pressure on 
the walls, so likewise does the confined radiation. This radiation 
\ Pressure , whose magnitude will be calculated below, is very small 
under conditions available in the laboratory, but may attain values, 
in the interior of the stars, comparable with the gas pressure. 

We know from experience that the spectrum of a glowing “ black ” 
body is continuous, i.e. all frequencies are present in the radiation. 
The problem of calculating the distribution of energy among the 
several frequencies is very similar to that of determining the distri- 
bution of the energy of a gas among the various molecular velocities. 
The superficial similarity between the experimentally determined 
energy distribution curve and the Maxwell distribution curve (fig. 1, 
p. 592) was recognized from the very first. No atom of a gas neces- 
sarily possesses exactly any selected velocity; we can only specify 
the number of atoms whose velocities fall within a given velocity 
interval Ac. In the same way, there is not a given amount of radiant 
energy corresponding to a single frequency v, but only an energy 
interval A £7 corresponding to a prescribed range Ap, i.e. it is oifly 
the quotient AJJ/Av which is finite. To prevent misunderstandings, 
which occasionally appear in the literature, it must be pointed out 
that these considerations hold only for the continuous spectra result- 
ing from temperature radiation. The statement occasionally made that 
there can be no sharp spectral lines, i.e. finite quantity of energy at 
a single frequency v, on account of thermodynamic reasons, is erroneous. 
If the excitation is not thermal, there is no reason for believing that 
a sharp line cannot be emitted. The fact that even in this case the line 
will really have a finite (but small) width is due to other causes. 

Before attacking the actual problem of computing the distribution 
of energy in a radiation-filled enclosure, we shall deduce the value of 
the radiation pressure from the electrical field equations. We con- 
sider only the simplest case, that of a plane wave incident normally 
upon the plane surface of a perfect conductor. The ideal case a — °o 
implies a perfectly reflecting surface. An electric field cannot exist 
within a perfect conductor, for if a field did exist, it would cause in- 
finite currents to flow; we may suppose, however, for convenience of 
calculation that a is merely very great, not infinite, in which case 
there will be currents very near the surface of the conductor. Allow 
the incident wave to travel in the direction of the positive x-axis, 
and let the surface of the conductor be the yz- plane. The plane of 
vibration of E, which is still undetermined, is taken to be the ^y-plane. 
The equation of the incident wave is then 

( 1 ) 

The boundary condition concerning the continuity of the tangential 
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components of E requires that the electric field strength be always 
zero for x = 0. (For perpendicular incidence there exists no normal 
component of E whose discontinuity would give rise to a surface 
charge. There is such a component, however, for oblique incidence.) 
The boundary conditions are satisfied by adding a reflected wave of 
equal amplitude whose phase differs from that of the original wave 
by 7T, 

EJ = - Ae tMit+xlc) (2) 


The perfect reflection of the mirror is represented by the equality of 
| EJ | and \E V \. Exactly as on p. 330, the corresponding magnetic 
vectors are determined from Maxwell’s equations: 

B,'= + Ae“ (,+ ‘ lc \ B„= B,'= B t = B x '= 0. (3) 

This indicates that the magnetic field does not vanish at the surface 
of the conductor, inasmuch as E* and H, have the same sign. Since 
currents are flowing very close to the surface of the conductor, the 
magnetic field will exert a force on the conductor. By the results of 
p. 310, the force on a volume element dr traversed by a current whose 
density is i is given by 

<£F = EiS dr* (4) 

c ' 


Now the vector i may be expressed in terms of curl H. Since E and 
dE fit vanish within the conductor, the first field equation yields 



curl H. 

47T 



Since only H B differs from zero, curlH has only a y-component, and 
we have 



1 d JL 

4 7T dx * 



Substituting this in equation (4), there remains only an ^-component 
of the force: 




Take as a volume element a cylinder of unit cross-section and altitude 

dx. To find the total force acting on this cylinder, we integrate from 
0 to oo : ° 

F* = — — - f°H g ^~dx == L /*\ 

4w ‘'o 0® Sn a °* * * 


uC 0 elemeDt “ of *• M » * 3 >°. w. 44 
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Since even for finite conductivity of the metal the wave penetrates 
but a very small distance, the magnetic field strength becomes zero for 
x = 00 » while, according to (3), it becomes | 2 Ae iut | = 2 | H | for x = 0. 
This is the force acting on unit area of the surface; we may therefore 
call it the 'pressure of the radiation. Since for a plane wave in a vacuum 


|E| = | H| and |S| = £|E| |H|, 
this pressure is, for normal incidence, 



In this formula | E | , | H [ and | S | signify the instantaneous values 
of the periodically varying quantities. What we observe, however, 
is the average in time, p n . This is obtained by inserting the time 
average of the Poynting vector on the right side of the equation. 
According to equation (60) (p. 341), this is equal to half the maximum 
amplitude, so that we have 




Thus the pressure of radiation is proportional to the normally incident 
energy flux S. For oblique incidence we have, in addition, an electrical 
force acting on the surface charge. The final formula has cos 2 0 as an 
additional factor, where 6 is the angle of incidence. 

The existence of such a pressure was first demonstrated experi- 
mentally and measured by L6bedew (1900), and in an extensive series 
of experiments conducted by Nichols and Hull (1903). The pressure 
found is in close agreement with the theoretical value. 


2. The Connexion between Energy Density and Surface Brightness. 

As already mentioned, the surface brightness is the radiation 
quantity which is experimentally measurable, while it is the energy 
density which is calculated by the theory. We therefore begin by 
finding the connexion between these two quantities. Consider an 
enclosure bounded by walls which are perfect heat insulators. Assume 
that bodies of any kind may be inserted into this space. Assume 
further that the dimensions of these bodies are large compared with 
the wave-length of the radiation, so that diffraction and scattering 
need not be taken into consideration. As a matter of fact, we wish to 
consider even differential segments large compared with the wave- 
length. Likewise, we wish to assume that the time intervals and their 
differentials are large compared with the period of the waves, so that 
the mean values of the radiation quantities are sufficiently defined. 
On account of the finite velocity of propagation of the electromagnetic 
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waves streaming through the space, there is a definite electromagnetic 
energy du in each element of volume. The limit du/dr is called the 
energy density p. This is spectrally analysed, so that the energy 
density pjdv corresponds to the frequency interval dv. The task of 
the theory is to determine p v for each interval as a function of the 
temperature of the enclosure. The surface brightness is defined as the 
amount of energy within the frequency interval dv radiated perpen- 
dicularly from an element of surface, per second per square centi- 
metre, within unit solid angle. This quantity is represented by 
2 Kftv; the factor 2 is present because the radiation can be resolved 
in two mutually perpendicular planes of polarisation. Thus K v is the 
surface brightness for linearly polarized radiation. Since we are con- 
sidering isotropic bodies only, the surface brightness for unpolar- 
ized radiation is twice this quantity. The introduction of an angle 
of divergence (solid angle) of the pencil of rays is necessary on 
account of the fact that a source of light of finite surface bright- 
ness can produce no exactly parallel 
pencil. In order to obtain such a 

pencil one must have a perfect point ^ q 

source at the focus of a lens. If the 
beam coming from the lens is to have a ^ 

a finite intensity, the surface brightness Fig , 

of the point (i.e. extensionless) source 

must be infinite. If the normal to the surface which is emitting the 

radiation makes an angle 6 with the axis of the elementary cone, then 

as may be seen from fig. 1— the energy flux per square centimetre is 
reduced to cos 6 times the value for normal emergence, for at oblique 
emergence the surface concerned in the radiation is increased by the 
factor 1/co. 9 This is known as Lambert’s Law. It is a consequence 
of tins law, for example, that a glowing cylinder appears to be a uni- 
formly illuminated rectangle and the sun gives the impression of being 
a uniformly bright disc. The portions of the surface near the rim are 
foreshortened, but radiate correspondingly less in the oblique direc- 
tion. Lambert s Law bolds only for “black” thermal radiation and 
is no longer vahd, for example, for the X-radiation produced by the 
nnpact of electrons upon the anticathode; in this case it is possible 
to obtain increased intensity by observing obliquely. P 

Accordingly we have for the energy within the frequency ranee 
dv radiated m the time dt by the surface do in a solid angle 8 

dQ = sin 9d9df> t 

whose axis forms an angle 6 with the normal to the surface 

d*u = &*u,dv = dvdida cos 6dQ . 2 K ( 9 ) 

As long as we consider only isotropic bodies, K, is independent of the 
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direction, and we obtain the radiation within dv emitted by da in the 
time dt by integration over the hemisphere: 

= d*„J v = 2 dvdldoK, f K f\osO smB d6d<j> 

= 2TrdtdoK v dv. , (10) 

For the total radiation we integrate over the entire spectrum: 

.00 


X cx> 

K v dv (11) 


If, now, we consider a volume element dr within a medium which 

is being permeated by radiation, this element 
will be traversed by a large number of 
pencils of rays originating outside it. Con- 
sider a definite portion of the neighbour- 
hood of dr bounded by a sphere of radius 
r whose centre is at the centre of dr (fig. 2). 
Resolve the radiation sent to dr from a very 
small element of surface of this sphere into 
elementary conical pencils. The portions 
of these cones within dr may be considered 
to be small cylinders. If the cross-section 
of such a cylinder is dS , the solid angle dO. 
of the small cone is dS/r 2 , and the amount of energy radiated by da 
into this cone per second is 



Fig. a 


^ = 2 dvdaK. % 

dl r 2 


( 12 ) 


This may be looked upon as the part of the energy flux within the 
elementary cone originating at da. The density of the energy flow is 
then obtained by dividing by dS. Now for any flow, the current density 
of any quantity is equal to the product of the density of the quantity 
by the velocity of flow (see, for example, p. 418). The latter is c/n, 
where n is the index of refraction of the medium. Thus the energy 
density in that part of the volume element pierced by the given cone 
which is due to the radiation from da is given by 


d 3 p = d 2 p v dv = 


2dvda 
r 2 . c/n 




This value holds for all parts of the element of volume dr , since the 
other cones which penetrate dr differ infinitesimally in direction; 
hence d 3 p gives the energy density in dr due to the radiation from da. 
We then have by integration over all surface elements of the sphere 

dp = pjv = dv. (14) 
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8t jK ¥ 

Py /* I Yt ’ 

c/n 



3. The Connexion between Emission and Absorption in Thermo- 
dynamic Equilibrium. Kirchhoff’s Law. 

Consider a uniform medium within the enclosure. At equilibrium, 
every element of volume of this medium will radiate a certain amount 
of energy each second, i.e. will convert a given amount of heat energy 
into radiation, and will absorb an equal amount of radiation delivered 
by the environment, converting it into heat. With regard to the pro- 
cess of emission and absorption, it must be remembered t his takes 
place in the atoms of the substance, so that a certain volume is neces- 
sary for the emission of a given quantity of energy. A mathematical 
surface contains no atoms, and so can emit no stated amount of energy. 
It is true that we often speak of the emissive power of a surface; in 
such cases we mean the total energy which streams through the sur- 
face from the interior of the body. Hence we define the emissivity t 
or coefficient of emission e v dv as the energy of linearly polarized radia- 
tion within the frequency interval dv, referred to unit volume and 
unit solid angle, emitted in unit time. Then the total amount of un- 
polarized energy in the range dv emitted in the solid angle in the 
time dt is 

d*u, = P>7rdvdtdr€ v (16) 


The absorption coefficient a v for frequencies in the range v to v -f- dv 
is defined in the manner which is usual in optics as the fraction of the 
incident radiation absorbed per second in a thin layer, referred to unit 
thickness. We again surround the element of volume by a sphere of 
radius r. The rays emanating from an element da of the sphere and 
penetrating the element of volume fill a solid angle which again may 

b ® + ^ b i Vlded , mt ° elementar Y cones; the portions of these cones 
withm the volume element may be considered cylinders, as before. 

If these cylinders are of altitude dl { and of cross-section dS ti then the 

total amount of energy absorbed in the time dt from the unpolarized 
pencil coming from da is given by 


<Fu a — 2 dtdvT. dlfCLydoKy. 


• • 07 ) 


Now is the volume of the element dr. Integration over all 

eurface elements of the sphere yields the following expression for the 
total amount of energy absorbed from the enviromnent: 


d 6 u a — Qndt dvdrOyK^ 



« i t i » 
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It we set the emitted energy equal to that absorbed, we obtain simply 




For any uniform medium in thermodynamic equilibrium , the ratio of 
the emission coefficient to the absorption coefficient for any spectral region 
is equal to, the surface brightness. This statement is often referred to as 
Kirchhoff s Law , but the latter refers to two other quantities, especially 
in its experimental applications, as we shall see immediately. 

If the enclosure is filled with a medium which is perfectly trans- 
parent to all rays, e.g. a vacuum (or even air, as long as the system is 
at temperatures available in the laboratory), then while it is true that 
K y is defined by the energy passing through a surface, the fraction 
€ J a y is °I the form 0/0. But the enclosure is bounded by walls of some 
kind, from which the radiation comes. We therefore investigate the 
radiation exchange at the boundary. The amount of energy passing 
through an area do of the wall within a solid angle dCl whose axis makes 
an angle 6 with the normal, in the time dt , is 

d 6 u = 2dvdtdodQ. cos 8K y . .... (20) 


Since there is thermodynamic equilibrium, the same amount of energy 
must pass through this same surface element in the opposite direction. 
Of this energy flow, only a part determined by the emissive power E y 
will come from the interior of the material constituting the wall, the 
rest being radiation which is reflected by the wall. The fraction of the 
incident radiation which is absorbed in the substance of the wall is 
called the absorptive power A y for the frequency range v to v + dv. 
The part 1 — A y is reflected at the surface of the wall. The energy 
equation at this surface is then 

2dvdtdodQ cos QK y = (1 — A v )2dvdtdodO. cos 6K V 

2 dvdt do d£l cos 8 E v , . (21) 


whence 



This is Kirchhoff’s Law. The ratio of the emissive power to the absorptive 
power of a body is equal to the surface brightness in the adjacent diather- 
manous medium. The difference between the concepts of emissive and 
absorptive power introduced here and the emission and absorption 
coefficients is that the former depend upon the thickness of the 
body. Even if the absorption coefficient is small, a sufficiently deep 
layer can destroy practically all the incident energy. A body for 
which A y = 1 is called a perfectly black body. For example, a sphere 
of gas — representing, say, a star — may be considered a black body on 
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account of the extremely great depth of material, although the absorp- 
tion coefficient of the gases is very small. 

The fact that the emissive power of a black body is equal to the 
surface brightness in the adjacent medium makes possible the con- 
struction of a black body in a simple way. While black substances 
like -platinum black absorb visible light completely, they are no longer 
“ black ” for longer waves, so that the use of such substances as black 
bodies may lead to deception. According to what has been said, a 
black body free from these objections is obtained by taking a large 
enclosure in the wall of which there is a small hole whose diameter 
must be so small compared with the size of the container that the 
radiation passing outward through the hole is negligible; its surface 
brightness is that which obtains in the diathermanous medium. The 
surface of this hole then represents that of a perfect black body. The 
effectiveness of this device may be seen by cutting a small hole in the 
side of a box made of black paper. The hole appears to be much 
blacker than the paper. In fact, the emissive power of a black body 
is experimentally determined by measuring the surface brightness of 
a small opening in the side of a uniformly heated furnace. 

According to Kirchhoff’s Law the thermal emission of any other 
body is obtained, for any spectral region, by multiplying the black 
body value by the spectroscopically determined absorptive power. 
The emissivity of a non-black body must be less than that of a black 
body for all colours, and can attain the black body value only in the 
regions of strong selective absorption. In instances where the emis- 
sivity exceeds that of a black body, e.g. for a gas mantle, the excitation 
is not purely thermal. Here, for example, the gas in the region of com- 
bustion is by no means in thermal equilibrium, and the glow is due in 
part to chemiluminescence. The same is true of “ cold ” flames, e.g. 
the zone of reaction of a sodium vapour jet in chlorine, which gives 

the yellow sodium line with considerable intensity at temperatures 
as low as 600° C. 


4. Planck’s Law of Radiation. 

The relation between the emissive power of a black body and the 
energy density of radiation in an enclosure having been made clear 
\ ^ remains only to calculate the energy density 

*5° th f of Statistics. All the necessary computational 

work has already been done m Chap. XXXVI, §3 (p. 60S). Consider the 
enclosure to be a cube of edge l Since the electromagnetic waves 
traversing this space are pure transverse waves, the number of possible 
frequencies m the range v to „ + dv is again given by P 


jy 8 -ItVv 2 

dZ = dv. 


(23) 
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Each wave is statistically equivalent to an oscillator, and since, by 

equation (9) (p. 606), the energy of a system consisting of N oscillators 
of frequency v is 

T j Nh v 

U Jtv.kT 1 ’ (24) 


the energy density within the interval dv , per cubic centimetre, is 



877^1^ 


c 3 (eT' kT — 1) 


dv. 



Hence for unpolarized radiation the emissive power of a black body, 
in the range dv, becomes 


2 E v dv = 2K v dv — 


2 hv* 


c 2 {e hv/kT — 1) 




This is the celebrated Planck Radiation Law. The curve E v (v) has 
the typical form of a Maxwell velocity distribution curve (cf. fig. 1, 
p. 592): first, a steep increase to a maximum, then an exponential 
decline toward higher frequencies. This exponential decay is charac- 
teristic of the Quantum Theory. The Rayleigh- Jeans Law, based on the 
equipartition of energy, requires the number of waves in a given 
interval to increase as v 2 , so that it would yield a continued increase 
of energy toward high frequencies, i.e. toward the ultra-violet, which 
is contrary to experience. 

If it is desired to express Planck’s Law in terms of the wave-length, 
it must be remembered that for a spectral interval expressed in dv 
and again in dX , 

E v dv = E x dX, 


and that 



| dX |. 


The radiation law (26) then becomes 


2 E x dX = 


2hc 2 

A 5 




Fig. 3 is a graphical representation of the content of this equation. 

The Law of Planck contains all other laws of radiation. The Wien 
Displacement Law, which was derived from quite different con- 
siderations by W. Wien before the Planck law was known, relates to 
the position of the maximum of the energy distribution curve. It is 
especially simple when expressed in terms of A: differentiating the 
right member of (27) and setting the result equal to zero, we have 

10 he 2 , 2 hc 2 ef l cihTX he _ n 

““ A ^c/hTA _ + A 5 (6 Ae/ArA — l) 2 * kTX 2 
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Eliminating the common factors and introducing the variable 
hcjkTX , this becomes the transcendental equation 


x — 


xe 


e x — 1 


= 5. 


\\ e see at once that there must be a root in the neighbourhood of 5. 
Applying the usual methods of approximation, the exact value is 
found to be 4*965. This is the only real root, as may be seen readily 
by drawing the curve. Hence we have 



he 

4-965 kT 


or 


\nax.2 1 = = b = 0-2896 cm. deg. 



This is Wien’s Displacement Law, which states that as the temperature 
of a black body is increased , the position of maximum emission moves 
in the direction of shorter waves in such way that the product X max .T 
remains constant. 

The total radiation is obtained by integrating over all frequencies: 



2 r”_x*dx_ 

erh* Jo e x — 1* 


This integral has been evaluated on p. 612, where its value was found 
to be 7r 4 /15. Thus we have 



2t7 4 /: 4 

15c‘ 2 /« 3 




This is the Stefan-Boltzmann Law, according to which the total energy 
emitted by a black body varies as the fourth power of the absolute tem- 
perature. Usually, however, we do not give the energy radiated nor- 
mally into a unit solid angle, but rather the total energy radiated on 
one side of a plane, i.e. into a solid angle 2 t r. The constant of pro- 
portionality cr so defined then has the value 7 rcr', according to equation 

(10) (p. 618), so that 



2tt 5 & 4 

15c 2 A 3 


erg. 


= 5-68 X 10- 5 2 TTITr* 

cm. 2 deg. sec. 


Ex. 130. Using a cyclic process, deduce the Stefan-Boltzmann La w, 
the factor of proportionality undetermined. The procedure resembles that by 
which the vapour pressure formula was obtained (on p. 534 et seq.). Show also 
that the pressure of radiation amounts to p = p/3. 



CHAPTER XXXVIII 


The Bose-Einstein and the Fermi -Dirac Statistics 


1. Preliminary Remarks concerning the Duality of Waves and Cor- 
puscles. 

In the preceding chapter we obtained the law of distribution of 
energy in an enclosure filled with radiation by counting the number 
of electromagnetic waves which could exist there. Now there are 
numerous phenomena where radiation is converted into energy which 
is connected with matter — kinetic energy of electrons, chemical energy, 
&c. In such instances light behaves as though it consisted of discrete 
corpuscular elements of energy, light quanta of energy hv. On the other 
hand, the corpuscular idea is not capable of including all properties 
of radiation, for an imm ense number of interference phenomena 
demonstrate the wave-like character of light. In Part VII of this 
work we shall bring out how the wave concept and the particle con- 
cept are two aspects of the same thing, and how the facts may be 
represented only by a combination of both ideas. We shall arrive at 
the conclusion that the light quanta are guided by the wave field, 
and that it is not possible to specify the positions of the quanta inside 
a wave. In view of the analogy between an enclosure filled with radia- 
tion and one filled with gas, it is to be expected that a corpuscular 
theory will also yield the radiation law correctly. When, however, 
we calculate the energy distribution of the quanta according to the 
Boltzmann statistics, we obtain the incorrect radiation law in which 
the energy is displaced into the short wave end of the spectrum The 
underlying reason for the failure of this method is that we assume the 
fight quanta to be independent of one another when applying the 
Boltzmann method. This independence certainly does not exist* if 
two coherent waves (bundles of quanta) interfere, this implies a mutual 
influence between the quanta, leading to a decrease in intensity at 
certain points and an increase at others. At any rate, a displacement 

°°S“; . The correct radia tion law is obtained by a new 
statistical method introduced by Bose. 

bm£ *Co££ membet ** th8re “ “ a0t “‘ “ ^ruction " of Ugh. in Into- 
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Furthep a nmnher of recent experiments indicate that it is 
ako incorrect to assume the independence of particles even when 
dealing with electrons and with entire atoms. There are guiding waves 

j°L th ^ Se P art ? cles too. and these waves exhibit the properties of 
diffraction and interference. These ideas will be discussed in greater 

that thT “ 1 P \. 687 \ At thlS point we wiat to say only 

that the statistics of direct intuition as we may call the Bolt*- 

mann statistics— is not applicable. We may then build up a new 
orm of statistics either in terms of waves or in terms of corpuscles 
The wave form would have the advantage that its deductions are more 
lLhimmating, once we assume the existence of the guiding waves. 
Nevertheless the results are often needed in the corpuscular form 
It is preferable, therefore, to employ the statistics in terms of cor- 
puscles; this is the form deduced by Bose, by means of a happy arti- 
hce, for light quanta. For ordinary gases the difference between this 
and the classical statistics is so small that it has not yet been found 
possible to distinguish between the two experimentally. For an elec- 
tron gas, however, the situation is such that for densities of the order 
of that attributed to the electron gas inside metals the results accord- 
ing to the two views are totally different. For the case of electrons, 
however, the Bose statistics must be supplemented so as to conform 
with a certain general principle of atomic structure. In t his 
way we arrive at a new form of statistics due to Fermi — a form 
which is capable of solving many of the problems relating to electrons 
in metals. 


2. The Bose-Einstein Statistics. 

The essential characteristic of the statistics devised for light quanta 
by Bose * and applied to actual gases by Einstein f is that no dis- 
tinction is made between individual atoms, even in the micro-states. 
In the case of light quanta it seems artificial to distinguish between 
particles; on the other hand, the calculation of probability as carried 
out in classical statistics seemed reasonable and natural, because we 
are accustomed to the concept of individual atoms. The phenomena 
of the diffraction of atoms (p. 695) compel us to give up the idea of 
individually distinguishing the atoms or molecules of a given substance 
also. The way in which this alters the computation of probabilities 
will be illustrated by a very simple example. Suppose we wish to dis- 
tribute three atoms among three cells. A macro-state of classical 
statistics is determined by giving the number of atoms in the first, 
second and third cells. We may s umm arize the various distributions 
as follows: 


• 8. N. Bose, Zeitsch. f. Phya. 26, p. 178 (1924). 
f A. Einstein, Btrl. Ber. p. 261 (1924). ibid. p. 18 (1925). 
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\ Group 

Cell \ 

I 

II 

III 

1 

3 

0 

0 

2 

0 

1 

2 

I 

0 

1 

2 

0 

3 

0 

1 

2 

2 

0 

0 

1 

1 

3 

0 

0 

3 

0 

1 

0 

1 

2 

2 

1 

Boltzmann Probability 

1 

1 

1 

3 

3 

3 

3 

3 

3 

6 


As long as we look upon the atoms as distinguishable one from 
another, the distributions of the first group can be realized in but one 
way, the second in three ways, and the third in six ways — for a dis- 
tribution in which Atom 1 is in Cell 1 and Atom 2 is in Cell 2 is taken 
to be different from a distribution in which Atom 2 is in Cell 1 and 
Atom 1 is in Cell 2. In Bose’s statistics no such distinction is made, 
so that each of these distributions has the same probability, and, 
according to Bose, a distribution of this kind represents only a micro - 
state. What, then, is a macro-state in the Bose statistics? A macro- 
state may be represented by various micro-states, and the number of 
these gives its probability. It is natural to discard the distinction 
between the cells and to characterize the macro-state by specifying 
how many cells are occupied by 0 , by 1 , by 2 , &c. atoms. Hence in 
the Bose statistics the macro-states are the groups bearing Roman 
numerals in the above tabulation. Thus State I has the probability 3, 
State II the probability 6 , State III the probability 1. It is to be 
noticed, however, that we are permitted to give up the distinction 
only between cells which are physically equivalent, e.g. those of equal 
energy. For instance, if we wish to compute the velocity distribution 
of the molecules of an ideal gas according to the new method, the zone 
of the phase space — to which a given interval of molecular energy 
corresponds— already contains so many cells that we may perform 
the above probability computation within it. The general formula 
for the probability of a macro-state defined in such a way that of a 
total of Z cells Z 0 are occupied by 0 particles, Z, by 1, . . . Z„ by n 

ni 1 ^! dei ? 7 7 ^ )b ^ ained J fr ° m the Allowing scheme by permuting the 
umbered cells instead of the numbered particles as heretofore: 

Schematic Representation of a Micro-state in the 

Bose Statistics 

Cell number: 123455 
O ccupied by: 0 1 Q 2 2 3 particles 

Macro-state: Z 0 = 2 , Z 1 = l, Z a = 2 , Z 3 = 1 , Z„=0 for „>3 

Should it happen that two cells are ocoupied by the same number 
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of particles, the interchange of their numbers will yield no new com- 
plexion, and hence the probability becomes 




where £Z fl is equal to Z, the total number of cells and EnZ n is equal 
to N, the number of particles. 

Let us apply the method to the calculation of the entropy and 
distribution of velocities in an ideal monatomic gas. Since each mole- 
cule has three position and three momentum co-ordinates, the size 
of each cell of phase space must be taken to be A 8 , as in classical statis- 
tics. Since the energy does not depend on the position, we can split 
up the phase space into the product of momentum space by position 
space, just as we did on p. 590 — i.e. we take the element of volume of 
momentum space, which alone concerns us in what follows, equal to 

A 3 

Ar,= y, (2) 


where V is the volume available to the molecules.* The energy of 
a molecule is given by 

u= ‘L {p * +p * +p * ) =L pi - • • • (3) 


Hence the value of the energy is constant over a sphere in the momen- 
tum space. If we divide this space into spherical shells of thickness 
A p, there will be a large number of elementary cells of size A 3 / V in 
each spherical shell; the Bose statistics can be applied to the distri- 
bution in the several shells, since the cells of each spherical layer 
possess equal energy. Distinguishing the shells by the subscript s, 
the probability of a state in which Z nt cells of the sth shell each con- 
tain n molecules is given by 




Z 3 \ 


nzj* 


W 


According to the Theory of Probability we obtain the complete prob- 
ability of any distribution by taking the product of the separate 


* Since At - AzAyAzAp-ApyAp*. where At is the volume element of phase space 
it may seem at first sight that we have to set At p - h* I AV, but once the > Quan 

£SX of °t h<> 'position energ^’ Z depend, at the 

(of. p. 690). according to 

we can take the entire volume as the range of the position co-ordinates. 
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probabilities. Assume the Boltzmann equation to hold for this 
quantity: 

S= k log If = k logll W t 

^JbSlogZJ-fcSSlogZJ 

t in 

= &£Z S log Z,— &£Z # — &X2Z nj logZ^-f &££Z R „ 

i I in in 

and since £ Z ns = Z„ 

5 = &2Z, logZ a — /fc££Z ns logZ,,,. ... (5) 


t n 


For the condition of equilibrium, W must again be a maximum, with 
the auxiliary conditions 

£Z nf = Z, — const., (6) 

for the total number of cells in each spherical layer is, of course, fixed 
by the geometry alone. Equation (6) then amounts to a number of 
auxiliary conditions equal to the number of shells. Further, we must have 

XLnZ n3 = N, (7) 

in 


where N is the total number of molecules, and 

££rw,Z n ,= TJ, 

S fl 



where U is the total energy and u , is the energy of a molecule whose 
representative point lies in the sth cell. We again deter min e the ex- 
tremal value of the entropy under the given auxiliary conditions by 
applying Lagrange’s method of multipliers. In taking the variation 
of the number of particles in each cell it must be remembered that 
the total number of cells in each layer is fixed, i.e. the numerator of 
the fraction in equation (4) is a constant. Then, from (4) and (5), 

0= ?? 8Zn,(logZ " s4 ‘ 1)== 0 (9) 


The variation of the auxiliary conditions gives 


2SZ W = 0 


( 10 ) 

for eaoh shell; moreover, for a constant total number of particles and 
constant intrinsic energy, 


and 


2£n8Z n# = 0, 
££nu,8Z n , = BU = 0. 


( 11 ) 


• • 


• • ( 12 ) 

We now multiply eaoh of the equations of (10) by a coefficient whioh 
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we take to be — (logo,-f 1), and multiply (11) by p and (12) by y. 
Adding these to the main equation (9) and setting the coefficients of 
8Z nt equal to zero yields 

logZ n9 — log a s -f np + nu 8 y = 0, 
or Z nt = a s e“"^ + W (13) 

The multipliers a t may be determined by introducing the number of 
cells in each shell: 

Z t = 'LZ n3 = Za,e" fl(/l+ ^'= i _ • ( 14 ) 

The last form is the sum of the geometric progression. Then 

a,= Z 9 ( l-e-O+W) (14') 

For the number of particles in each shell — which is the goal of our 
computation — we have 


N, = EnZ ns = a,Sne »0+y*«*) = — a# 


Jg-flOJ + yu,) 


= — a 


s )(l — + 


s d{P + yw 

Inserting the value of a a , this becomes 

Z, 


+ yw„) n 
a s e-(fi+vu,) 

" {1 — e _w+Y "* ) } 2 


( 15 ) 




gO + Y«,> — 1 


(16) 


It remains only to determine the multipliers ft and y. Since y is multi- 
plied by the energy of the cells, the total energy is a function of y and, 
reciprocally, y is a function of the total energy TJ . Inserting (16) 
in the entropy equation and taking the derivative ( dS/dU) v = 1/T, 
we obtain, exactly as on p. SSS’, the value 1/kT for y.* In order to 
determine ft we must turn to equation (7), which gives the total number 
of particles. The summation or integration, however, involves com- 
putational difficulties, so that we shall leave p undetermined. Setting 
eP = B, we have the distribution formula 


N.= 


Z s 


Be u,!kT _ 1 


(17) 


For an ideal gas, the layers within which the energy may be con- 
sidered constant are the spherical shells included between the radu 
p and p, + dp.. The equations connecting energy and momentum are 


„.= J-p.2, i.e. p. = V2mu 
2m 


dp. 



( 18 ) 


The corresponding multiplier was there denoted by 0. 
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Hence a shell contains 


dZ,= 


477 rV 2 m z u, du t 

¥JV 


(19) 


cells of volume A a /F, and, omitting the subscript s, the number of 
particles in the separate shells becomes 


dN = 


47tFV2w 3 u du 
h 3 (Be ulkT — 1)' 


( 20 ) 


Finally, expressing u in terms of the velocity, we have the distribution 
formula . fa 

7 it » 7/fr C CM/ //\/\>\ 

rfxV = ,a . vx ttttttt: r. •■•••• (20 ) 


If B ^ 1, we obtain 


h z (Ber t£ * l2kT — 1)* 
,,, 47 TVm 3 c?e~ me * l2kT , 

dN = m — *■ 


. . . . ( 21 ) 


while the classical statistics gave (p. 592) 

,, 7 4Nc?e~ m£ ' l2kT 

dN — 



( 22 ) 


Thus when B is large, the Bose -Einstein statistics also leads to the 
Maxwell distribution. Equating the two expressions determines B: 

° Nh » 

We are justified in equating the two expressions for the reason that 
integration over all velocities must yield the total number N of mole- 
cules in each case. The preceding equation shows that B is inversely 
proportional to N, so that our assumption of B > 1 has greater 
validity — and hence the Bose distribution agrees more nearly with 
the Maxwell form — when N is small, i.e. when the gas is at low pres- 
sure. Actually, deviations are first to be expected at those densities 
and those low temperatures at which all gases begin to exhibit strong 
^n der Waals departures from the ideal gas law. Up to the present 
wme it has not been found possible to separate the two kinds of devia- 
tions to the extent that one can decide with certainty whether the 
gas obeys the Bose statistics or that of Bolt zmann. 

3. The Fermi-Dirao Statistics and its Application to Eleotrical Con- 
' ? £ ductivity. 

* ® 0Be statistics has not yet become of much practical 

roj^rtance op account of the difficulty of testing it experimentally it 
lfl, nevertheless, of great service in deriving a new form of statistics given 
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by Penni and independently by Dirac* This method appears to be 
the appropriate means of treating the electron gas in metals. In the 
theory of the structure of the atom it is shown that a given com- 
bination of quantum numbers can appear but once in each atom. 
This law, which is of fundamental importance, is known as Pauli’s 
Principle (cf. p. 679). Otherwise stated, this law says that a cell of 
phase space can be occupied by only one representative point. Fermi 
applied this principle to the totality of particles in a gas. Thus the 
numbers Z na vanish for all values of n except 0 and 1. If the compu- 
tation is performed with this added hypothesis, all calculations for 
determining the most probable state remain unaltered. One must 
remember only that the summations for n are to be extended only 
from 0 to 1. The actual summations are not performed until the 
multipliers a, are determined. At this point the method departs from 
that of the Bose statistics. We have, at first, as for the Bose statistics, 


Z na = a,e~ n ^ +yu ‘\ 


(24) 


To determine the a, we sum over all values of n; in this case the sum 
reduces to 

Z, = a ,(l + e~ (fi+yu * ) ) (25) 


The number of particles in the shell of subscript s becomes 

2 S 


N s =l-Z u =a s e-«+y“J = 


e (fi + yu s ) X* 


(26) 


As in § 2, (p. 630) y is found to be 1/kT, so that 

2 , 


N.= 


S e u s ,kT _|_ 1 


(27) 


This result has a remarkable similarity to the Bose distribution func- 
tion — the difference is only in the sign of the 1 in the denominator. 
This shows that the Fermi distribution, too, approaches the Maxwell- 
Boltzmann for large values of B, i.e. for low densities. Hence for 
ordinary gases we can no more expect to test the validity of the Fermi 
statistics than we could that of the Bose method. But the situation 
is entirely different for an electron gas of the type we are forced to 
assume exists within a metallic conductor (cf. p. 444 el seq.). If we 
assume that the number of conduction electrons is of the order of 
magnitude of the number of atoms, the density of the electron gas 
becomes so large that B is very small, especially smce the small mass 
of the electron operates, according to equation (23), to affect the 
result in the same way. At ordinary temperatures , assuming the Fermi 

• E. Fermi. Atti Lincei (6), 3. p. 145 (1926); Zeitschr. /. Physik. 30, p. 902 (1926). 
P.A.M. Dirac, Proc. Roy. Soc., 112A, p. 661 (1926). 



633 


XXXVIII.] BOSE-EIN STEIN AND FERMI-DIRAC 

statistics to be valid , the electron gas is completely degenerate, i.e. it 
obeys a totally different distribution law from that for an ideal gas. 

We now wish to use the Fermi statistics to derive the most impor- 
tant property of the electron gas. First, we must double the number 
of cells belonging to a given energy range, as compared with the number 
for an arbitrary gas. The reason is this: it will be shown in Part VII 
that it is necessary to ascribe to the electron a magnetic moment of 
one Bohr magneton. According to the rules of the Quantum Theory, 
the axis of this moment can set itself either in the same or the opposite 
sense to a given field direction. For each of these positions we have the 
number deduced above for the number of cells of the translational 
momentum space. If we now pass to a very weak field, the magnetic 
contribution to the energy vanishes and the two orientations become 



Fig. i 


equal in energy, i.e. the existence of the magnetic moment of the 
electron doubles the number of cells in each shell. We thus have 



Inserting this value in equation (27) and omitting the subscript s, we 
tarn the number of electrons whose energy lies between u and u + du: 


&7tV\/ 2 m z u 112 


h 3 " Be ulkT -f- 1 ^ U ‘ ‘ * * ( 29 ) 

fYl ^ nat n ° W ®? ek to ex P ress the constant B in terms of N by in- 
tegratmg over all values of the P.r.rov 1 ° 7 m 




mam 


i&jsy. 




N = 


877 F V 2m 3 


F ifil* du 
'0 Be“' kT + T 


(30) 





itely the integral oannot be evaluated in dosed form. We 

a somewhat orude approximation: if we 
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draw the curve of the integrand function it is seen to be a parabola as 
long as Be“< kT can be neglected in comparison with 1 (cf. fig. 1 ), but as 
soon as the exponential becomes comparable with unity, the actual curve 
descends and finally approaches the axis of u asymptotically. If we 
erect a vertical line through the point u Q , which is determined by 

B e u.!k r =1 or u 0 — —lcT log 5, 


it is evident that the two shaded areas are approximately equal. We 
may therefore replace our integral approximately by the area under 
the parabola between 0 and u 0 . In order to avoid difficulties with the 
sign, introduce the constant 1/4 in place of B , i.e. 


— log 5 = -f-log4. 

Then the area becomes 




u l l 2 du 


^e“' kT + 1 

A. 


ph §(&T log 4) 8/2 



Inserting this in (30) and solving for log .4, we obtain 


, . h 2 / 3 A \ 2/3 
° g 2mkT \8 ttv) ’ 


(33) 


and hence the distribution function becomes 

87 r F V 2m 3 u ll2 du 


dN = 


h 3 


* e ulkT-(h';2mkT)(.3NI8nW s J* 


(34) 


The result becomes especially vivid if, instead of giving the number 
of electrons whose energy is between u and u -f- du and which are 
in the volume F, we specify their density in the momentum space. 
Since the shells have each a volume 

dr = 4^rp 2 dp = 4^rr\/2mhi du, 
the density of points in the momentum space becomes 

dN 1 /QK\ 

P ~~ ^ e ulkT-(h t l2mkT)(SNl8nV^I > ^_l‘ * * * V ' 


For energy values less than 

A2 / 3# y/» 

2 m \87rF / 

the denominator has the value 1 at T = 0; for energies greater than 
this value, it is infinite. Thus at T = 0 the density of the representative 
points as a function of the corresponding energy is constant up to a 
certain point and then drops suddenly to zero. This result can be 
understood on the basis of the Pauli Principle. Since each cell may 
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contain but one electron, all energy levels must be occupied, from the 
lowest upward, until all electrons are provided for. Raising the tem- 
perature tben has merely the effect of rounding the edges, as one may 
verify by inserting numerical values of T which are greater than zero. 

In order to compute the energy of the electron gas we must evaluate 
the integral r x 

V = J o udN. (36) 

In the same way as before we obtain as a first approximation 


U = 


8ttTV 2 m 3 r u sl2 du SnVV^r¥ 




A 3 


87rF\/2m 3 

¥ 


It 


& 


0 l 


A 3 


r rio 
u 


log A 

*' 2 du 


{kT log A) 5 ' 2 ( 37 ) 


Inserting the value of log A , we obtain, finally, 

167rVV2r¥hW 2 T s i 2 / 3 N \ 5 ' 3 4ttFA 2 / 3N \ 5 ' 3 


„ IQnVV' 2m 3 h*k 5l2 T sl2 ( 3 N \ 5 ' 3 

^ 5A 9 (2m) 6/2 A 6/2 T 6/2 {&¥?) ” 


5 m 


/ 3 n y/» 
\8 tt7/ • 


(38) 


Thus to a first approximation the energy content of the electron gas 
is independent of the temperature; i.e. the energy found above is 
already present at the absolute zero. But since the specific heat is 
obtained by differentiating the energy with respect to T , it follows 
that the specific heat of the electron gas vanishes, to a first approxi- 
mation. Thus the greatest obstacle in the electron theory of metallic 
conduction is eliminated, for the classical statistics leads to an increase 
of the specific heat of the solid metal from 622/2 to 922/2 per mol, 
assuming the same density of electrons as here. This increase is not 
observed. On the other hand, the Fermi-Dirac statistics leads to the 
observed absence of a contribution to the specific heat. It is a matter 
for a more complete calculation to compute the electrical and thermal 
conductivities of metals based on the new distribution. This has been 
done by Sommerfeld and his co-workers in a series of papers, but the 
etails cannot be given here.* A further advance was made by taking 
into account the interaction with the lattice ions by the introduction 
or a spatially periodic potential. This computation must be based on 
quantum mechanics and so it is described later (Chap. XLI, p. 707 et seq.). 

W i arri ? at a “^quantitative understanding of all electrical 
° f mefcaIs one ’ ^ose explanation is lacking v^. 

zero the metal consi ^ s jj* the faot that at a few degrees above absolute 

(p 676 « tS f ° 0llU T P’ mb > ^ Va ^ Via of the periodic table 

J 036 ^ Metrical resistance. The tran- 

appearanoToT^JLL 0 ^^ ° f the materiaL Connected with the dis- 
ppea of resistivity is the disappearance of the magnetic permeability. 

Consult A Sommerfeld, Zeiiachr.f. Phya., 47 1-2, pp. 1, 43 (1928). 
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4. Entropy Constant and Chemical Constant of an Ideal Monatomic Gas. 

The thermodynamics of the vapour pressure curve and of chemical 
equilibrium peremptorily demands that the absolute value of the 
entropy should be determined (cf. p. 553). Since the Quantum Theory 
leads to a determination of the size of the element of phase space, it 
is to be expected that the absolute value of the entropy may be cal- 
culated readily, for the size of the element enters into this quantity 
(cf. p. 563). However, a difficulty exists in respect that the classical 
quantum statistics yields a quantum condition only for periodic 
motions. Now the translational motion of the atoms of an ideal gas 
(we restrict the present discussion to ideal monatomic gases) is entirely 
non-periodic ; if we wish to apply the Quantum Theory we must arti- 
ficially introduce a periodicity, say by treating the to and fro motion 
between the walls as a periodic motion. Moreover, in order to arrive 
at a reasonable result with such considerations, we find ourselves facing 
an inconsistency with the Boltzmann statistics; here we do not dif- 
ferentiate between two states resulting from the interchange of two 
atoms, while in Boltzmann’s method two such complexions are 
taken to constitute two different micro-states. For this reason we 
refrained from deriving the chemical constant on the basis of the 
classical quantum statistics (Boltzmann statistics augmented by the 
quantum condition), since this actually uses the principal feature of 
both the Bose and Fermi methods, viz. the absence of any distinction 
between like atoms, which obtains even in the micro-state. The ab- 
solute value of the entropy follows in a simple manner from the postu- 
lates of the new statistics. The Bose and the Fermi statistics both 
lead to the same result, since according to equation (23) (p. 631), the 
constant B for ordinary gases is large compared with unity, and so 
the “ one ” in the denominator of each distribution function may 
be dropped. This makes the two forms identical. 

In order to derive the entropy constant we start from the formulas 

of §2 (p. 629): 

S = k'ZZg log Z e — k 2 £ Z nt log Z na , • • • (b) 

* i » 

with the auxiliary conditions 

( fi) 

S x"nZ n .= N = ZN (7) 

s n 

X^nu 9 Z n$ = U = SiV 9 u v ♦ • • • (®) 

i * 
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There we found the most probable distribution to be 

Z nt = a,e-”U+“s'*T) (13) 

Z,*a i (Nr'-v«)-i | .... (14) 

N a = Z t (ef>+''slkT_ l) -i (16) 

Under the assumption that B = & » 1 , i.e. l/B = < 1 , we obtain 

log Z, = log a, -f = log a, + ~ e" # ( 39 ) 


= I’ 


(40) 


(VwiTbtl * 6 VaheS b0m (39) and (13) “ the '°g arithma of equation 
S=/kZZ, log a, + is I* e-V*T — *S 2 Z„ , log a, + EE Z„, nfi 

i ft 

(«) 




JSs?.— « - « *- “ AS 

= kN + A02V + ? ( 42 ) 

Taking one mol, i.e. L atoms, and putting u = 3*7/2, we obtain 

»=R+fM + $ R=f3R+lR . . . (43) 

8iv “ br “>»■“« <«> ip- «D 

i-iog«+*ioga' + | + i og fcg^ _ > (44) 

S?and ^‘perfo th? Ch f \ mica! constant - * » assumed to be a function 
of state we'have ”| = ^°, nV “ the hel P of equation 

*- 1 ^ *+ I logp + 1 (46) 

gM d become* = 5R/2 ’ the ° hemioal oonstant of ™ ideal monatomio 

W = 4^4 = ^ (W)««**/. 


A 


4 s 


• t 


• («) 
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Inserting the numerical values of the atomic quantities and intro- 
ducing the logarithms of the factors needed for the conversion to 
practical units (p in atm.), we obtain the formula quoted on p. 538: 

i = —1*59 + f log ^4 (47) 

Ex. 131. Thermal Ionization'. Calculate the chemical constant of the electron 
gas in the non-degenerate (rarefied) state and use the result to compute the 
thermal equilibrium between atoms, ions and electrons in a hot gas in terms of 
the work of ionization of the atom. 



PART vn 


The Structure of Atoms and Molecules 
and the Theory of Spectra 




CHAPTER XXXIX 


The Mechanics of Simple Atomic Models 

l. Investigation of the Structure of the Atom; the Scattering of 
a-Particles. 

The many phenomena in which electrons are released from neutral 
atoms or atomic aggregates — e.g. the photoelectric effect and the 
phenomena of electrical discharge in gases — suggested from the very 
first that the atom is a structure made up of electrons, the charge of 
which is compensated by an equal number of positive elementary 
charges. The question immediately arises of how the two kinds of 
charge are arranged within the atom, and where the inertial mass is 
located. This question was attacked experimentally by probing or 
sounding the atom by means of projectiles of subatomic size. These 
were sent through the atoms, and the arrangement of the charges was 
deduced from the alterations in the paths of the particles. Lenard, 
who first undertook experiments of this kind, used electrons as pro- 
jectiles. ^ He recognized the fact that the volume occupied by an atom 
in gas-kinetic collisions or in liquids and solids is merely the effect of 
electrical fields of force due to charges within the atom, while the 
intrinsic volume of the parts of the atom is vanishingly small. Ruther- 
ford and his associates obtained particularly simple conditions by 
using a-particles expelled from radioactive substances instead of 
electrons The former carry a positive charge of two elementary 
units and, like the electrons, are of exceedingly small size, but have a 
mass almost 7400 times as great— the same as that of a helium atom 
For this reason, the deflection of the path of an a-particle requires a 
centre having a large inertial mass, inasmuch as the conservation of 

"S™ * ha * the centre of gravity of the two bodies 
r eleotro ^ the ««*» of gravity is practically coinoi- 

fhfe to ft a “ “-particle. Hence the use of a-particles in thu way is 

atom t S( h f' m3h ^ 0rmatl0n concerning the location of the mass of the 
atom, so far as it is connected with eleotrio charges. C. T K Wilson 

found that if ions are present in supersaturated dust-free air tW 


22 


• 0. T. R. Wilion, Phil. Tran*., A, 189, p. 266 (1897k 
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a ct as centres of condensation for water droplets. This is the principle 
of Wilson s so-called “ cloud chamber ” If the ions are produced by 
the passage of a-particles through the air, the path of the particle is 
outlined as a “fog track Photographs of such tracks show that a 
deflection through a large angle occurs suddenly, rather than by the 
cumulative effect of many smaller deflections. This further simplifies 
the theory of the effect to the extent that we may limit our calculations 
to simple scattering and omit from consideration the rare case of a 
single particle experiencing multiple deflections. 

Consider now what happens to a single particle. We shall ascribe 
to the scattering centre a mass so great that we may consider it at 
rest during the scattering process. Since the central mass can in no 
instance be greater than that of the atom, this holds only for scattering 
by atoms of heavy elements. Actually, the first experiments were 
made with such elements. Assume the scattering centre to have Z 
elementary charges. Then the force acting on the a-particle is 
2 Ze 2 /r 2 . Since this law of force is the same as that of gravitation, so 
far as the dependence on r is concerned, we can apply the results of 
p. 91 at once. The path is thus a conic section with the scattering 
centre at one focus. This leaves the question of the sign of the central 
charge open. The hyperbolic orbits in either case differ only to the 
extent that the central body occupies the focus on the concave side 
when the force is attractive, and is at the focal point on the convex 
side when the force is one of repulsion. In harmony with other experi- 
ments, in which the positive charge is always associated with masses 
which are integral multiples of that of a hydrogen atom, we assume 
the central charge to be positive. Comparing the potential energy of 
gravitation —ymM/r with the electrostatic energy -f-2Ze 2 /r, we see 
that yM must be replaced by —2 Ze 2 /m in the formulae of p. 90, where 
m is the mass of the moving a-particle. In order to avoid confusing it 
with the mass of the electron, we give it the subscript a in what fol- 
lows. According to equation (21) (p. 91), the equation of the orbit 
becomes 2 2 > 


r 2 Z 


r = 


Id- 


em 



+ 


4 Ze 2 . Z 2 e 4 


( 1 ) 




+ 


mjc 


- cos <f> 


where c is the areal velocity. The negative sign signifies that if the 
positive value of the radical is taken, the branch of the hyperbola 
directed away from the centre is the one described by the particle. 
If, for example, we take <j> = 0, the corresponding value of r in fig. 1 
is not to be drawn to the right, but to the left. At the beginning of 
the motion the particle is at an infinite distance, i.e. 1/r is zero, and 
its velocity is v 0 . If the initial direction, which is represented by the 
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asymptote of the hyperbola, is a distance p from the centre, the areal 
velocity at the beginning is v 0 p/2 , since the curve and the asymptote 
are practically coincident at great distances from the centre. This 
areal velocity is represented by the shaded triangle in fig. 1. Because 



of the constancy of the areal velocity we may put this value in equation 
(1) m place of c, thus obtaining 4 


v 0 2 p 2 m, 


r = 



: / 2 , 4ZM 

V 0 + n.. 2_.2 C ° 3 <t> 


(2) 


V 0 ' 

The numerical eccentricity becomes 

~ 4ZV 


4 


V’pX 2 (3) 

Denote the acute angle between the initial direction nf rto „ i 
and the positive z-axa by y. Then the of . the particle 

b^ th6 ° ne “Pondingte^hetoal 0 S 

0 7 T 




From the geometry of the hyperbola,* 

cos (*• - y) = _ coty 

wbZ J r -°» W “ lrom the P° lar Ration r - 


tan ^ = 1__ 

2 V? — 1’ 


P/(l + « 008 ft that ooa j 
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ho that half the angle of deviation is given by 

1 



’o 2 ?V 2 
4 Z 2 e 4 



4 Z 2 e* 
v 0 2 p 2 m 2 



2 Ze 2 


( 5 ) 


This simple formula gives the angle of deviation as a function of the 
parameter p\ it is impossible, however, to test the validity of the 
equation experimentally, since we cannot directly determine the dis- 
tance of the initial direction of a given particle from an atomic nucleus. 
The result of the computation can be tested only by statistical methods, 
in which the theory of probability is applied to the distribution of the 
various parameters in order to determine that of the angle of de- 
flection. In fact, this marks a weakness in logic of the entire atomic 
mechanics, as based on models: the quantities necessary to determine 
the course of a single process cannot be determined exactly;* then, 
if we pass to a statistical distribution, the quantity p, which cannot be 
observed, again drops out of the calculations. Thus the elementary 
process is used only as an intermediate concept. The direct procedure 
would be to assume that a pencil of a-rays passes into the field of 
atomic nuclei and to inquire into the distribution of the rays after 
passing through this region. This is, in fact, the way in which such 
problems are attacked by the newer quantum mechanics to be dis- 
cussed in the following chapter; there an attempt is made to operate 
only with directly observable quantities. For the moment, however, 
we shall continue to discuss the atomic models, and shall now compute 
the angular distribution of scattered particles by means of such con- 
cepts. 

If the layer of atoms which is penetrated has a thickness s, and if 
there are N atomic centres per unit volume, then a hollow cylinder of 
radii p and p Ap whose axis is in the direction of the incident rays 
contains, on the average, 


A N = ZTrpsNAp 


( 6 ) 


nuclei. For the small values of p which are considered, this is a very 
small number, e.g. 0-001. The introduction of an average value is 
justified by the fact that the primary pencil has a finite cross-section, 
so that the layer is penetrated at various points, which is equivalent 
to using an infinitely narrow pencil of rays and moving the bom- 
barded layer of atoms to and fro within che same limits of variability. 
Assume that each particle is scattered once; then, of n primary par- 
ticles, a number 

A n = n . 2rrpsNAp (7) 

* Later we shall find that these quantities are inherently indeterminable, quite 
apart from the insufficiency of our methods of attack. Cf. p. G90. 
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will suffer “ impacts ” with a nucleus lying between p and p A p. 
Now by (5), p is connected with the angle of scattering, 9, by the 
formula 



2e 2 Z 

v 0 2 m a 






e 2 ZA9 

v 0 z m a sin 2 9/2 



so that the fractional number An/n of a-particles scattered between the 
angles 9 and 9 -j- A9 is 


An 

n 


iirsNe^Z 2 cos ? 

z 

V m a 2 sin 3 f 


A 9. 



Formula (9) becomes invahd when 9 == 0, for then it yields An/n = oo . 

This would mean that practically all a-particles go through without 

being scattered. The discrepancy is explained by the fact that we are 

not permitted to take an infi ni tely large upper limit for p but must 

use the finite value given by the mean distance apart of the atoms of 
the scattering material. 


2. Conclusions from the Experiments on the Scattering of a-Particles. 

The scattering formula (9) is well confirmed for the heavier elements 
as scattering centres and for finite angles of deflection. The experi- 
mental verification of the scattering formula justifies, for one thine 
our hypothesis of electrostatic forces between the particles and the 
nuclei and confirms the assumption of simple scattering. Further a 
comparison of the scattering powers of various substances shows that 
may be set equal to the number of atoms per unit volume and that 
Z comcides with the place of the element in the periodic system. We 
shaU discuss this important result in greater detail below. 

tn £££“* 9 °°i m ® ometimes observed. If we use equation (9) 
to g determine the value of p corresponding to a-particle velocities of 

order of To 1 “cm 0 Th ffi “ ^ S ‘ 6 °l 9 °°’ We obtain a value of 

order oi 10 cm. This defines the radius of the sphere within which 

fche , atom “ concentrated, and the Coulomb law of 
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be maintained by the electrons surrounding the nucleus; their elec- 
trical fields of force prevent the closer approach of other atoms. 

However, the most important result of the scattering experiments 
is the recognition of the fact that the nuclear charge Z consists of a 
number of positive elementary charges equal to the position number of 
the element in the periodic table. Formerly, the elements were arranged 
in the periodic system in the order of atomic weight. At certain places 
this sequence had to be violated in order to satisfy the sequential 
arrangement according to certain chemical properties. We now re- 
cognize that the nuclear charge is the fundamental parameter, and 
this makes the order correct at the places mentioned. Since the atom 
as a whole is normally electrically neutral, the number of electrons 
outside the nucleus must also be equal to Z. The position number Z 
of the element was called the atomic number by Moseley (cf. p. 655). 
We thus have the important relation: 


Nuclear charge = Number of electrons outside the nucleus 

= Atomic number of the element. 

Hydrogen, of atomic number 1, has a nucleus with one elementary 
charge whose mass is that of a hydrogen atom (less the mass of the 
electron). M n is 1*66 x 10 -24 gm. The hydrogen nucleus is to be 
looked upon as the atom of positive electricity, which differs from its 
counterpart — the electron — in having a much larger mass. The 
nucleus of the hydrogen atom is called a proton. Further, since helium 
has the atomic number 2, and since the a-particle has a resultant charge 
of +2, and a mass equal to that of the helium atom, we recognize the 
a-particle to be a helium nucleus. One might conclude that the nuclei 
of the heavier elements consist of protons only, but there are two 
serious objections to this idea: since the mass of each proton is that 
of a hydrogen atom (the mass of the electron may be neglected in 
comparison), the atomic number would have to be equal to the atomic 
weight, which is obviously not the case. Prior to the discovery of the 
neutron (p. 750) it was assumed that electrons were present in the 
nucleus itself, thus serving to reduce its net charge. Supporting this 
view was the fact that in many radioactive changes — which are cer- 
tainly nuclear processes — electrons are emitted in the form of /Lrays. 
However, the assumption that electrons exist in the nucleus led to 
such great difficulties that it had to be abandoned, and now we explain 
the difference between atomic weight and nuclear charge in terms of 
neutrons present in the nucleus. 

The second objection — that if the elements are built up of protons 
and electrons all atomic weights, referred to hydrogen, must be whole 
numbers (which is evidently not so)— was removed by the discovery 
of isotopes. As shown especially by Aston in his analysis of canal rays 
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with the mass spectrograph (p. 441), the chemical elements are largely 
mixtures of atoms of different atomic weights, but having the same 
nuclear charge and hence the same number of outer electrons. Apart 
from small deviations, the several atomic weights are integral multiples 
not of that of hydrogen itself, but of a weight about 8 per cent smaller, 
i.e. 1/16 the atomic weight of oxygen. The small departures from the 
integral multiples are to be interpreted on the basis of the principle of 
the inertia of energy as differences in the energy of formation of the 
nuclei. The large value of these energy changes (obtained by multi- 
plying the difference in mass by the square of the velocity of light) 
corresponds with the order of magnitude of the energy released in 
radioactive processes. 


3. The Bohr Model of the Hydrogen Atom. 

Now that we have before us a general rough picture of the atom 
in the form of a heavy nucleus about which electrons cluster, we 
naturally desire more definite information concerning the arrange- 
ment of these electrons. The first question which arises pertains to 
the nature of the force which balances the electrostatic attraction of 
the nucleus, thus preventing the electrons from falling into it. Similar 
situations in celestial mechanics suggest that the opposing force is 
centrifugal force. We are thus led to assume that the electrons execute 
rapid motions of revolution about the nucleus. This idea was followed 
out mathematically by Bohr,* first for the simplest case of the hydro- 
gen atom, and with the simplest form of electron orbit— the circle. 
There is one great difficulty. The revolving electron, together with 
the positive nucleus, represents a miniature antenna in which two 
vibrations which differ in phase by tt/2 are taking place. According 
to the laws of electrodynamics, an oscillator of this kind must radiate 
energy in the form of electromagnetic waves. Such an energy loss 
can occur only at the expense of the potential energy, i.e. the orbits 
must decrease in size, so that the electrons traverse spiral orbits which 
ultimately lead into the nucleus. This cataclysm must take place 
mthm time intervals of the order of 1<H sec., so that stable atoms of 

^ d , are ° ut ° f the > question. But the validity of the electro- 
dynamic laws has been demonstrated experimentally only for macro- 
scopic antennae ; it may be that other laws obtain for the individually 
unobservable atomic radiation processes, which are limited only bv 
he requirement that the average effect be in harmony with the jn&cvo- 

S'ttt ofrT 10 ! reSUltS ' B0hT thM made a P^ulate to the 

of . the continuous sequence of meohanicaUy possible orbits 

only a discrete set is capable of existence. These orbite are assmS 

• N. Bohr, Phil. Mag., 26, p. 1 (1913), 
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bo have the property that no radiation takes place when the electron 
is in one of them. The criterion for such an orbit is taken to be the 
condition that the phase integral (p. 130) is an integral multiple of 
rlanck s quantum of action: 



J 




The second postulate is that the emission or absorption of radiation 
(light) takes place according as the atom passes from a higher to a 
lower energy state or vice versa. At the same time a quantitative re- 
lation found experimentally to hold for all elementary energy trans- 
formations, e.g. the photoelectric effect, must be satisfied: the energy 
difference of the two states must be equal to the quantum hv of the 
emitted or absorbed radiation. No statement is made regarding the 
actual mechanism of the process of radiation or absorption. Also, 
there is apparently no connexion between the frequency of the radia- 
tion and the frequency of revolution of the electron in its orbit; we 
shall find, however, in § 6 (p. 655) that there is an asymptotic con- 
nexion between the two. 

Let us now apply the two postulates to the hydrogen atom, whose 
electron revolves about the nucleus in a circular orbit of radius a. 
The kinetic energy T amounts to 

T = £ ma 2 oj 2 = £ ma 2 <j> 2 t (11) 


so that the momentum co-ordinate is 

= ma 2 <j> = ma 2 cu. 



The phase integral is to be extended over an entire circuit of the 
position co-ordinate — in this case from 0 to 27 r: 



(j) pdq = J ma 2 w dj> = i lnrna 2 uj. 



We then have the quantum condition 

27rma 2 aj = nh (14) 


In addition, there is the equation of mechanical equilibrium 

^ = maw 2 (15) 

a 2 


for these quantities, we obtain 

a = 


n 2 h 2 

. . (16) 

477 hne 2 


87 

. . (17) 

n 3 A 3 


and 


oj = 
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Inserting numerical values for e, h and m, we obtain for n = 1 the 
value 

<h = 0-528 x 10" 8 cm., (18) 


i.e. a value which corresponds very well with the gas-kinetic atomic 
radii. The fact that a purely chance agreement between these quan- 
tities is highly improbable lends plausibility to this model of the 
atom. The values of io correspond to the frequencies of ultra-violet 
light; although the radiated frequencies have no imm ediate connexion 
with a>, it seems significant that they are of the same order of mag- 
nitude. In order to compute the actual radiated frequencies on the 
basis of the second Bohr postulate, we must write down the energy in 
the nth quantum state: 


E n = T+ U = 





2 



27 T^me 4 47 ■Awe* 

n 2 h' £ 


277 2 we 4 

< 19 > 


The negative sign arises as a result of taking the zero point of the 
potential energy at infinity. The potential energy, which is exactly 
twice the kinetic energy, is negative in the finite region, since work is 
obtained when the distance between the nucleus and the electron 
decreases from infinity. The magnitude of the energy with a positive 
s gn sign^es the work which must be done in order to remove the 
electron horn its orbit to infinity, i.e. to detach it from the atom. 
Ihis work is usually expressed in “ electron volts ”, i.e. the difference 

obtafiTthis m VO tS) ^° Ugh Wk ‘ ch an electr °n must faU in order to 

am0Unt ° f 1 7 et ‘ C enel gy- The ionization potential of an 

volte bet h 617611 J® the ener 87 difference, expressed in electron 

volte, between the normal state and the state of complete removal The 

forner is the state of lowest energy-the one having the lowest quantum 

tethe d’ff f ° r hydr ° gen tbs 18 T The equation potential of a given state 
is the difference m energy between this and the “ground” Site 

. . , 10 , 

lines emitted by the hvdrnpfin I° n a ^ the spectral 

”lilttX2“ tZ S° it*" 7 "?* “ *** 

» in to toibl. «gi 0n , h „ U b 
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series. The lines crowd closer together as the “ current number ” 
increases, and at the series limit— given by n, = oo , i. e . vanishing 
variable term — there is an accumulation point of spectral lines, in the 
mathematical sense. Actually the intensity of the lines decreases as 



the series limit is approached. The 
series of lines belonging to n f = 1, 
= 2, 3, 4, ... lies in the far 
ultra-violet region, and was not 
found until later by Lyman. The 
lines corresponding to n f = 3, 
— 4, 5, ... and to n f = 4, 
n,- =5, 6, 7, . . . are in the infra- 
red, and are named after their dis- 
coverers the Paschen Series and 
the Brackett Series respectively. 

The origin of the individual 
lines is best visualized with the aid 
of an “ energy level diagram ” as 
in fig. 2, in which the various energy 
levels which can be assumed by 
the atom according to the first 
Bohr postulate are represented by 
properly spaced horizontal levels. 
The transitions are represented by 
arrows in the direction of tran- 
sition. The central numbers are 
wave-lengths in Angstroms. 

An outstanding success of the 
theory is to be seen in the fact that 
it not only yields the form of the 
law obeyed by the hydrogen lines, 
but also gives the correct numeri- 
cal value of the constant of the 
formula, when the numerical values 
of e, m and h are inserted. It is 
true that the accuracy to which e, 
m and h are known is far below 


that available in spectroscopic measurements, so that once we have 
attained confidence in the theory, we shall rather use the constant to 
determine e, m and k. 

It is customary in spectroscopy to use the reciprocal of the wave- 
length, 1/A, rather than the actual frequency, c/A. These so-called 
wave numbers are also designated by v. In order to avoid confusion, 
we write v with a bar over it to denote wave numbers. If the Balmei 
formula is written in these units, 



XXXIX.] MECHANICS OF ATOMIC MODELS 


651 



the constant R is called the Rydberg Number. 
constant is given by 



2-77 2 e 4 m 

ch z 


cmr 1 


and its numerical value is 


By equation (20), this 



R = 109737 cm.” 1 . 


4. Consideration of the Motion of the Nucleus. The Spectrum of Hydro- 
gen and the Spectrum of Ionized Helium. 

Even if we neglect the relativistic variability of the electron mass 
the simple formulae derived in § 3 (p. 649) are in need of refinement! 
srnce we assumed the nucleus to be at rest. This is true only for in- 
finitely heavy nuclei; in reality, the motion takes place about the 
common centre of gravity of nucleus and electron, and in computing 
the energy we must take into account the kinetic energy of the motion 
of the nucleus also. Denoting the mass of the nucleus by M, its dis- 
tance from the centre of gravity by A , and the distance of the electron 
from this pomt by a, we have 

MA — ma (23) 

The condition of mechanical equilibrium for the nucleus is 


and for the electron, 


(a + Af 


* = MAu>\ 


• . . . (24) 


e 2 


(a + A)'- 


= macj % . 


(25) 


6Xpress both “ and A terms of the distance 
that ^ betWee “ nucleu3 and eleetron. From equation (23) it follows 


a+4=,„(i + 2L). i. e . and A = a+A 

i+S i+~ 

M ^ M 

We define a " reduced ” mass y. by means of the equation 


m 

M' 


(26) 


l .=l+i 

m m M' 


(27) 


mm. 

f,’ le ' «=(<» + 4)£. . . . (28) 


Then 
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and the condition of equilibrium for the electron becomes 


(a + Af 

The total energy becomes 


= -f A)co 2 . 


(29) 


T + U = ~ (MA 2 -f- ma 2 ) — - 

* a A- A 


f-LOJ* , , A\9 e 2 

= 9 ( a + A ) 7 — r 

* a-\-A 


(30) 


Comparing these formulae with the corresponding ones of § 3, we see 
that they become identical if we replace the electron mass m by the 
reduced mass /z and if we understand a n in § 3 to mean the distance 
between the nucleus and the electron. Then the Rydberg constant 
for hydrogen becomes 


r, 27 rVV* _ 

* ch? 


= 109677-69 cm.' 1 , (- = I + -L\ (31) 

XH'B 771 n J 


For an element with in fi nite nuclear mass we have the former value 
of this constant: 

2 -5=- « 


R = Rrr, = 


For a one-electron system whose nucleus has the mass of the helium 
nucleus, we have 



2-7r 2 e 4 p He 

ch 3 






The spectrum of this system is found in nature. If one of the two 
electrons of the helium atom is removed by violent excitation we have 
such a system ; electrically this structure differs from a hydrogen atom 
only in having twice the nuclear charge. This means that the factor 
2 must be supplied to e 2 in the equation of mechanical equilibrium. In 
general, for a nucleus having a charge Z, the factor Z must be inserted. 
The formula for the energy levels contains e 4 , so that the factor 4 (in 
general, Z 2 ) must be supplied; the energy levels of the singly-ionized 
helium atom (He + ) become:* 


E n = 


4 x 2n 2 e 4 ^ He _ 4 R He te 


n 2 hr 


n- 


(32) 


Thus the wave-lengths of the lines which correspond to the hydrogen 
lines are one-fourth those of the hydrogen lines, so that the He + spec- 
trum corresponding to the Lyman and Balmer series lies in the far 
ultra-violet. For n t = 4, however, the lines fall in the visible region. 

♦ The negative sign is omitted before E n , since it is customary in spectroscopy to 
reokon the term values as representing work done in removing an electron. 
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. (33) 


we recognize that alternate lines must coincide approximately with 
hydrogen lines. This is the so-called Pickering series, which has been 
known for a long time in stellar spectroscopy. It was formerly attri- 
buted to hydrogen, inasmuch as so close a connexion between the 
spectra of two different elements was looked upon as quite improbable 
before the advent of the Bohr theory. 

The difference between the H and He + wave-lengths is obtained 
by replacing M He approximately by 4M H : 

1- m 


R 

R 


Ut 


M n (m + M g „) 
M H ,(m + M„) 


4 


1 + 


in 




m 

Ml 


(34) 


M 


This shows that the wave-lengths of the hydrogen lines are 0*04 per 

cent greater than those of the neighbouring He + lines. Reversing the 

procedure, we can use the very accurately measurable difference in 

wave-length to determine e/m spectroscopically, for we have 

e 

‘• e - ™~ F '**. - R B • (35) 

m 


m 

M 




a 




m 


In the exact computation we must introduce— besides the precise 
atomic weight ratios— a series of corrections — e.g. the difference be- 
tween atomic mass and nuclear mass must be taken into consideration 
Equation (31) is of importance in connexion with the discovery of 
deuterium, the hydrogen isotope of mass 2, whose nucleus is called a 

Setting the doubled value in place of M n in this equation 

yields the positions of the deuterium spectrum lines in excellent agree- 
ment with experiment. 6 

5. X-ray Spectra. Moseley’s Law. 

An atom of atomic number Z which has lost all but one electron 
will emit radiations given by electron 


= z*r(\-1\ 

W n ?) 


(36) 


if we neglect the motion of the nucleus, which, moreover is of less im 
portance as the atomic number increases. On account of the factor 7.'- 

**■ ray region. Highly ionized atoms of t.hio At... , K 
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experimentally; nevertheless, there are X-ray lines which, in part, 
conform to the above law. In order to understand their origin we must 
examine m more detail our ideas concerning the structure of the heavier 
atoms. We have every reason to believe that even when such an 
atom is in its normal state all electrons are not in the lowest quantum 
state, corresponding to n = 1. Rather, there are groups of electrons 
having the same quantum number n which we may provisionally 
imagine to be arranged in various shells of radii a 1? a 2 , . . . a H . These 
sheds have come to be denoted by the letters K, L y M, N . . ., corre- 
sponding to the quantum numbers 1, 2, 3, 4 Now the X-ray spectra 

differ experimentally from the visible and ultra-violet spectra in that 
absorption lines do not occur. We observe merely continuous regions 
of absorption having abrupt terminal edges at the long wave-length 
end. This phenomenon can be explained as follows: in the case of 
visible spectra, the outermost electron is brought to a higher orbit 
in the process of absorption. If, however, one of the inner electrons — 
say a K electron — is to be raised to a higher level, we find that all 
the places in the L , M, . . . shells of a normal atom are occupied, so 
that this electron can find room only at the periphery of the atom. 
But the work necessary to bring it to that point is practically the 
same as that required for complete detachment, assuming the electron 
to possess no kinetic energy after removal. The long wave absorption 
limit, then, corresponds to the work done in detaching the inner elec- 
tron, and the continuous spectrum which follows it corresponds to 
transitions in which the electron possesses varying amounts of kinetic 
energy after leaving the atom. Since the energy of translation of a 
free electron is not of a periodic nature it is not subject to a quantum 
condition of any kind, and so the energy levels form a continuous 
sequence. A similar continuous spectrum is found at the series limit 
of visible spectra, both in absorption and in emission. These continua 
originate in a similar way. 

If, now, an inner electron is detached from a heavy atom by radia- 
tion or by electron impact, its place can be filled by an electron from 
another shell. This results in radiation. Neglecting all interactions 
between the electrons, we would find that the X-ray emission lines 
originating in this way follow the hydrogen formula. In any actual 
case it must be remembered that according to the laws of electro- 
statics the electrons in the inner shells screen the nuclear charge from 
the surroundings, so that we must introduce a screening constant s„ 
for each shell in such way that the energy of an n-quantum electron 


is given by 


E n =(Z- s n ) 2 




The fact that 8 n is in large measure independent of Z is a justification 
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of this somewhat crudely constructed theory. We can set s — 1 for 
the K and L shells; then for the line corresponding to a transition 
from the L to the K shell we have 

f=(Z-l)*fi(I-i) = t(Z-l) , fi. • . (38) 

This is the celebrated Moseley Law for the K a line.* This line 
is found in the X-ray spectra of all elements from Z = 4 to Z — 92. 
The atomic number of any element can be determined at once with 
the help of Moseley’s Law. However, the detailed theory of the X-ray 
terms is very complicated, and by no means yet complete. We have 
given here only the broadest outlines of the theory. 

6. The Correspondence Principle. 

Equation (17) (p. 648) states that the orbital frequency of the 
electron in a hydrogen atom is given by 

w irrhnet 

2-77 ^ hW < 17 ) 


According to the classical connexion between radiation and electrical 
oscillations this is the same as the radiated frequency. On the other 
hand, the Quantum Theory gives the expression 




27 r 2 e 4 w n, 2 — n f 2 

' »,v 


. . ( 20 ) 


for the frequency of the radiation connected with the transition 
n { -* n f . If n< — n, is small compared with n { we can write instead 

477 2 e 4 m . 

Vqu = (n< “ Uf) (20') 

We see that in the limiting case of large quantum numbers the tran- 
sition An = 1 yields a frequency identical with the classical frequency, 
the transition An = 2 gives the first harmonic 2v di &c. Thus there 
is an asymptotic connexion between the frequencies given by the 
Quantum Theory and the classical frequencies corresponding to the 
mechamcd motions of the system. For this reason a quantui theory 
eomputation of the radiation from an antenna would lead to the same 
result as that already obtained. The extension of this relationship 
whch is called the Principle of Correspondence , makes it possible to 
fiJl, in part, a gap m the previous theory. Thus while the oleooia l 

as mays wi"- 

whioh equation (38) ia obeyed exaotly. 1 only tto middle dements for 
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the frequency but also of the intensity, polarization and coherence of 
the emitted radiation from a knowledge of what is occurring in the 
system (cf. p. 335 et seq.), the Bohr theory gives only the frequency. 
In the language of the Bohr theory, the intensity corresponds to the 
frequency of occurrence of a quantum transition. The two fundamental 
postulates have nothing to say concerning this quantity. 

Bohr then assumes — and his assumption is justified by its success 
that an asymptotic connexion obtains for intensity and polarization 
also, in such a way that the frequency of the transitions producing 
light of a given state of polarization is determined by the amplitude 
of the radiation as calculated on classical principles. For small numbers 
it is not determined whether the initial or the final orbit is to be taken 
— it is possible only to make an estimate. The results become unique, 
however, if a given harmonic component is absent from both the 
initial and the final states. In that case the corresponding quantum 
transition certainly does not occur. In this way we arrive at principles 
of selection which in many cases limit to a great extent the number 
of possibilities of combination of the various energy levels. 

The Correspondence Principle also points the way to an extension 
of the quantum hypotheses to systems with several degrees of freedom. 
According to the classical theory, the frequencies of a system are given 
by (cf. p. 130) 




hence any combination frequency is given by 




where the s { are any positive or negative integers. Again, for a rela- 
tively small energy change, the energy difference is given as a function 
of the changes of the phase integrals. Neglecting higher terms in the 
Taylor expansion, 

BE . , . dE . - BE 


A E = hv qu = 


a j , AJl + a 


AJ 2 + 


BJ 


A J 


(41) 


If the asymptotic relation between the classical and the quantum 
frequency is to be retained, the phase integrals for the case of several 
degrees of freedom must change by integral multiples of h. This con- 
dition will certainly obtain if we impose the requirement that not only 
their changes but the quantities themselves are to be integral multiples 
of h. But it follows also from the asymptotic connexion that for de- 
generate systems there are only as many quantum conditions as the 
system has periods. The Kepler ellipse in space is a doubly-degencrate 
problem, for corresponding to the three co-ordinates of the electron 
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there is but a single frequency — that of the motion in the orbit. This 
means that the energy states of the Kepler ellipse depend upon only 
one of the action variables, viz. J 1 (cf. p. 135). This determines only 
the major axis. Hence all ellipses having the same major axis have the 
same energy. The circular orbit also belongs to this set; therefore the 
formulae of § 3 and § 4 are valid also for elliptic orbits, provided we 
understand a to mean the semi major axis. The introduction of elliptic 
orbits is necessary, since with only circular orbits the Correspondence 
Principle would yield the result that on account of the absence of 
overtones in the initial and final states only transitions for which 
An = 1 would be allowed. This means that only the first member of 
each series would appear. For elliptic orbits, on the other hand, all 
higher harmonics are present. 


7. The Atomic Spectra of the Alkalis, the Alkaline Earths and similar 
Systems. 

All calculations of the energy levels of more complicated atoms 
start from the hydrogen terms, and the modifications of the term 
values are approximated to by calculating the perturbations due to 
the other electrons. In order to be able to perform this calculation, 
we require a far-reaching schematization of the atom, whose justifi- 
cation is, moreover, given by the newer Quantum Theory. We consider 
first a system which we have reason to believe, especially from chem- 
istry, consists of a loosely-bound electron— the valence electron— and a 
core consisting of the remaining (Z — 1) electrons, and a nucleus with 
Z charges. The normal atoms of the first column of the periodic table 
the alkalis and the series Cu, Ag, Au— are represented by such a 
system. Because of the screening effect of the core electrons, the 
valence electron is under the influence of an effective charge of one 
unit. If we neglect entirely the perturbations caused by the core, we 
obtam the hydrogen terms. Now it has been known for a long time 
that the alkali spectra have considerable similarity to the hydrogen 
spectrum. It is to be expected, then, that the deviations can be 
explained as disturbances caused by the electric field of the core. We 
seek to take this field into account by adding to the Coulomb potential 

f the effective unit nuclear charge a spherically symmetric perturbing 
potential in the form of a power series ^ 


{7,= f3 + ??4-fi* + 

* f2 T T r • • • . 


. . * . (42) 


fie i d . ako re P resents a central force directed toward 

fa nnwT’ the ° rblt mU8t remam plane > wording to p. 89. Thus 

o7 rr ,0n m 8 , pace remams fo^ 11 and 80 the angle variable to, 

cfmnW De aUnay . 6 • ment f (P ' 136 > remains constant. However the 
mplete computation of this mechanical problem show* t.W <-i„. 
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perturbing potential causes an advance of the perihelion, i.e. the angle 
variable w 2 becomes a true variable. As a result, the energy no longer 
depends on = alone, but also on J 2 = J 6 -\- J ^ an d 

we have two independent fundamental frequencies: that of the motion 
in the ellipse and that of the advance of perihelion. On the basis of the 
Quantum Theory we are thus required to introduce, in addition to the 
principal quantum number n connected with the description of the 
ellipse, a secondary quantum number k in such way that J 2 = kh. 
Now it can be shown that J 2 /J Y = k/n gives the ratio of the minor to 
the major axis of the ellipse, so that we must have 1 ^ k ^ n. The 
precessing ellipse traces a rosette orbit whose energy is found as follows. 
We compare this energy with that of the ellipse which osculates the 
true orbit at aphelion; this is the path which the electron would pursue 
if the perturbing field were absent. Since the potential of the core, 
and still more the field of the core, decrease with a comparatively 
high power of the distance, the potential energies of the two orbits 
may be taken to be approximately equal at aphelion. But the kinetic 
energies are also the same, for since the centripetal force is the same, 
the centrifugal force mv 2 /p is also the same in both paths; moreover, 
the radius of curvature p is the same for the two osculating paths, so 
that the kinetic energies are identical. In general, the osculating ellipse 
will not be exactly a quantum orbit. We assign to it a non-integral 
effective quantum number n * by means of the definition of hydrogen- 


like terms: 




Since, by the Law of Areas, the part of the orbit lying close to the 
core is traversed very rapidly, while the perturbations due to the core 
electrons are appreciable only along this part, we can — to a first 
approximation — set the rotational frequencies on the two paths equal 
to each other. For the ellipse, for which there is but one quantum 


number n*, we have 


dE(n*) 
h dn* ’ 



while for the rosette we must write 

_ dE(n, k) 
V hdn 



since E depends also on k. From the equality of the two frequencies 
it follows that the difference (n — n*) cannot depend upon n itself, 

but only upon A: f n * = » + ^ (A) (46) 

dE dE_ dE_ 3n* . p dn* _ ^ 

3 n dn * dn* dn’ dn 


t For 
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We thus arrive at the “ Rydberg Term ” 

E= ~ ? hc 

[n+4>(k)r * * * 



We obtain various term sequences according to the value of h. That 
corresponding to k = 1 is called the S sequence; k — 2 yields the P 
sequence; k = 3, the D sequence; k— 4, the F sequence. The origin 
of this notation will be explained below. 

A further improvement is obtained if, instead of setting the periods 
of ellipse and rosette equal, we assume that their difference depends 
only upon the secondary quantum number k. This assumption is 
justified by the fact that the perturbation takes place mainly on the 
inner portion of the orbit, and this is determined primarily by the 
eccentricity, i.e. by the secondary quantum number k. Thus is obtained 
the “ Ritz Term ” 


£ = 


— Rhc 


[■ + «« + m 


■ . <«) 


which represents each term sequence k = const, by means of two 

constants <j>(k) and tp(k). This formula yields an excellent represen- 
tation of the observed data. 

The possibilities of transition between the energy levels corre- 
sponding to various values of k are drastically limited by the Corre- 
spondence Principle. It is true that the Fourier analysis of the motion 
of a rotating Kepler ellipse yields all harmonics of the orbital fre- 
quency y v but on account of the uniform advance of perihelion no 
harmonics of the precessional frequency result; v 2 occurs only in 
the combination sv x ± v 2 . In terms of the Correspondence Principle 
this means that a change of the principal quantum number is always 
connected with a change of the secondary quantum number by +1 

7T* “Sf 68 t o at an £ fcerm can combille onl 7 with a P term, a P t7im 

° T b- ? „ In wa y- we interpret the empirically 
tound series for which the following notation is usual: * J 

s =</>(!), p = 0(2), d=0( 3), /=0(4) 
a =0(1), 77 = 0(2), 8=0(3), 0=0(4) 


nS = 


R 

(nTH) 2 or> more Precisely, 


R 


nP = 


R 



(M 7 p? or » more precisely, 


R 



r» 


+?+ 


-i 


I* 
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The series are then written simply: 

Principal Series: v = IS — nP , 

Sharp Series: v = 2P — nS, 

Diffuse Series: v = 2 P — nD, 

Fundamental Series: v = 3D — nF , 


n = 2, 3, 4, 
n= 2, 3, 4, 
n = 3, 4, 5, 
n = 4, 5, 6, 


[Chap. 
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This terminology has the following origin. The Principal Series con- 
tains those lines which the normal atom can absorb, since the fixed 
term with which the P sequence combines is the ground term. These 
lines are also the easiest to excite in emission. The names of the Sharp 
and Diffuse series arise from insignificant differences in the sharpness 
of the lines. In the German literature these series are called the “ Second 
Subordinate Series ” and “ First Subordinate Series ” respectively. 
It is to be noticed that they have the same series limit. The Funda- 
mental Series is so called because it was considered of particular sig- 
nificance on account of the fact that its terms differ but little from 
those of hydrogen. German writers refer to this series as the “ Berg- 
mann Series ”, after its discoverer. All possible combinations are not 
covered by these series, however. Apart from the lines fitting into the 
historic series, a large number of “ combination lines ”, e.g. 3D — 3P, 
are known. 

Besides the spectra of neutral atoms of the first column of the 
periodic table, the spectra of singly-ionized alkaline earth metals and 
of doubly-ionized atoms of the third column are also of the alkali type. 
We note that in all these cases only one valence electron is present. 
The factor Z 2 which must be supplied in the term formula is equal to 
4 for the singly-ionized alkaline earths, and 9 for the doubly-ionized 
elements of the third column. But we can distinguish the same series 
also for the elements with two valence electrons, i.e. the neutral atoms 
of the second column, f We must imagine these series to originate when 
one of the two valence electrons occupies the orbits corresponding to 
the several energy levels, the other valence electron being counted as 
part of the core in computing the terms; i.e. for a spherically sym- 
metric perturbing potential we again imagine its charge spread over 
a sphere. The fact that this approximation is better than might be 
expected is explained by the Wave Mechanics. 

Besides the lines fitting into this system, it has been known for a 
long time that there are lines which originate by combination with a 
new sequence of energy levels empirically called the P' term sequence. 
The remarkable feature of these terms is that they become negative 
for higher values of the current number and converge to a negative 

* These current numbers are not the actual principal quantum numbers, but are 

the nearest integers to the effective quantum numbers. 

f The difference between systems with one and with two valence electrons is mam 
fested by the “ fine structure ” of the lines. This is discussed below. 
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limit. This evidently has the following meaning. If one of the valence 

r and higher P' levels and 
from the atom, the amount of work required exceeds the amount 

the magni- 
tude of the energy of the negative limit. But this work can be done 
only on the remainder of the atom, i.e. the limi t, of the P' terms corre- 
sponds to an ion with an excited electron. Thus the accented terms 
belong to atomic states in which both valence electrons are raised 
simultaneously. Then the difference oo P — oo P' must correspond to 
the difference between two terms of the ionized atom. As a matter of 
fact, for calcium, for example, this difference is equal to that of the 
1£ and 3 D states of the ion. 


needed to remove it by way of the normal energy levels by 


finally freed 


electrons is raised to highe 


The distribution of the energy of excitation among several elec- 
trons makes the spectra of elements with a number of equivalent 
electrons very complicated. The development of series becomes less 
and less pronounced, since the energy of excitation is employed to lift 
several electrons, instead of raising a single electron to a high orbit. 
However, the abundance of lines in such a spectrum is due in part also 
to a further difference which we shall examine in the following sections. 

Before we take up these questions we shall attempt to give a more 
complete physical foundation for the potential of the core. If we 
compare the spectroscopically determined values of the functions </> 
and ip in the Ritz formula with those which would result mathemati- 
cally from given coefficients c* c 3 , c 4 , . . . , we find that for the alkalis, 
from k 3 onward, the coefficient c 4 of 1/r 4 predominates. This may 
be interpreted as follows. Under the influence of the valence electron, 
the electron shell will be displaced slightly in a direction away from 
the valence electron. As a result, the centre of gravity of the positive 
charge will no longer coincide with that of the negative charge, and a 
dipole is formed. If the polarizability of the ion is a, the dipole 

moment will be | m | = ae/r 2 ; but its potential at the place occupied 
by the electron is * 




wLch a 7 J 6 °i2T ^ a . ; fr0m J the R y dber g and Ritz corrections 
the enr« M refrac 1 t , 10n date on the ion corresponding to 

Rvdwl f ' P ' t \ E ° r Sma11 values of k ’ i- e - tor S and P terms, the 

Ltrihnf ^ to the raglon of the core electrons, which we imagine 
?om Sfi k °r r a 8 P hCT cal . 0- derivation of the Rydb rg 
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8. Perturbation of Electron Orbits by External Forces, 
and Zeeman Effect. 


Stark Effect 


The methods of Celestial Mechanics, whose fundamentals were 
derived in Chapter VI, Part II of this book (p. 108), together with the 
quantum conditions, permit us to calculate the perturbations of the 
term values caused by external electric or magnetic fields. The calcu- 
lation becomes particularly simple when based upon two theorems 
which must be given here without proof. 

The first theorem, the so-called Adiabatic Principle, states that if 
perturbations are produced in a system by slowly altering the external 
conditions, the energy is in general changed, but the phase integrals 
retain their former values. If, then, a state is once characterized as 
a quantum state by values of the phase integrals which are integral 
multiples of h, it remains so when the system undergoes an adiabatic 
change. The meaning of the theorem may be illustrated by means of 
the so-called Einstein pendulum. Let a pendulum bob suspended by 
a thread swing with an amplitude corresponding to a quantum state 
nhv of an oscillator. Let the frequency be raised to v by slowly 
shortening the thread at the point of support. As may be verified 
readily in this case, the pendulum takes up enough energy to give it 
an amplitude corresponding to nhv . 

The second theorem, which becomes evident immediately, states 
that if there is a perturbing function in the expression for the energy, 
the value of the additional energy due to this cause is, to a first approxi- 
mation, equal to the mean value of the perturbing energy taken over 
the unperturbed orbit. For example, if a hydrogen atom is brought 
into a homogeneous electric field F whose direction is that of the 
z-axis, a perturbation term — eFz is to be added to the Coulomb term 
— e 2 /r in the expression for the energy, and the average change in 
energy is — eFi. Naturally, this depends on the eccentricity and 
orientation of the orbit, i.e. the orbits having the same major axis — 
which have the same energy in the unperturbed atom — assume dif- 
ferent energy values when the field is applied; in other words, the 
energy levels are subdivided. This subdivision is very complicated in 
the case of hydrogen, and it must be looked upon as one of the finest 
accomplishments of the theory that it makes possible the compu- 
tation of these patterns. This phenomenon is the Stark Effect , dis- 
covered independently by Stark and by Lo Surdo. The theory shows 
at once that the subdivision of the levels occurs in this form only for 
one-electron systems. For multiple-electron systems a splitting into 
orbits having various eccentricities but equal major axes is already 
caused by the field of the core. This is merely the decomposition into 
S f P , D term sequences. The external field can exert but a small 
additional influence which manifests itself in a displacement increasing 
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with the square of the field strength and, occasionally, by splitting. 
On the other hand, the field has a great influence on the intensity of 
the lines of these atoms; the field gives rise to harmonics which, accord- 
ing to the Correspondence Principle, make possible transitions which 
cannot occur in the absence of the field. Since very intense local fields 
occur in condensed spark discharges, we see why many “ forbidden ” 
lines appear under such conditions. 

The perturbations caused by a magnetic field are of even greater 
importance in the investigation of the structure of the atom, for a 
field of this kind causes the spectral lines of the atoms to subdivide in 
very characteristic ways. This is the Zeeman Effect. Application of 
the second perturbation theorem leads into difficulty, since the force 
— e[t>H]/c exerted by a magnetic field on an electron has no potential. 
One might even believe, at first, that a magnetic field could cause no 
change in energy at all, since the force is always normal to the path, 
and so can do no work on the electron. This is, of course, true for the 
field itself, but it must be remembered that while the magnetic field 
is building up there exists a time derivative dH/dt, and hence, by the 
second Maxwell equation, an electric vorticity (cf. Ex. 89, p. 316). 
This curl determines an energy change, but at the same time the orbit 
remains a quantum orbit on account of the Adiabatic Principle. For 
simplicity,, we shall assume the field to increase at a uniform rate, 
attaining its full value after N electronic rotations. Then we may 
replace 3H/3 1 in the Maxwell equation by H jN T . The work done by 
the electric field during the time of one revolution of the electron is 

~ = — 6 ^ E ^=-e(ciirlE)S=? 5.S, 

where S denotes the area and orientation of the orbit. The total work 
done after N revolutions is then 




On account of the Law of Areas we can replace S/T by the areal 

velocity dS/di and express this in turn by the mechanical moment of 
momentum d. Then we have 



• • • • i 



But according to p. 463, the magnetic moment of 
whose moment of momentum is d is given by 


an electron orbit 



so that 


9 


. . (62) 
. . (63) 
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lt,Ti d ° ne in in f r ' asin V the field from zero to Us final value H is 
equal the energy of the magnet ic dipole of strength f> in the field H 

ln ,, What f °™ 13 thls ener gy Present in the atom? Since there is 
no such quantity as potential energy of an electron in a magnetic field 

this energy can be present only as kinetic energy. To a firlt approxi- 
mation the change of motion can be given very simply. We prove 
a theorem given by Larmor, to the effect that to a first approxima- 
tion the only effect of the magnetic field is to cause a uniform rotation 
of the entire electron system about the field as an axis, the direction 
and magnitude of the rotation being given by 


to = H 

2 me ' 


(54) 


If the co-ordinate system to which the electron motions are referred 
is in rotation with respect to an inertial system, the following inertial 
forces arise (cf. p. 233): the Coriolis force 

F 0 = — 2 m[tov rel l (55) 

and the centrifugal force 

F r = — m[io[tor]]. 

If the product [tor] is small compared with the velocity in the moving 
system — which is, of course, the case here — then the centrifugal force 
can be neglected in comparison with the Coriolis force. The force 
exerted by the magnetic field, on the other hand, is 




where, strictly, v should be replaced by the absolute velocity 
v rt\ + [co but may be replaced approximately by v tt \. The two 
forces neutralize each other, i.e. determine the same electron orbits in 
the moving system as those which would be present without the field 
in the system at rest, if 

I , o+F m = — ?[w„iH] + 2m[w„,w] = 0, or io = ^H. (57) 

This is known as the Larmor precession. The corresponding change 
in kinetic energy — which may be positive or negative according to 
the relative positions of co and the orbital angular momentum — is the 
work done by the field. In what follows we shall use the convenient 
formula (53) to determine the change of the term values. 

As long as any arbitrary orientation of <p — or, what amounts to 
the same thing, of the angular momentum vector d — with respect to 
the field H is permissible, we obtain a continuous manifold of energy 
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changes. This means that a quantized energy value would become 
diffuse in the field, so that no splitting of lines but only a broadening 
could take place in a magnetic field. However, it must be remembered 
that the Larmor precession of a Kepler ellipse determines a uniform 
rotation of the line of nodes, and the third Delaunay variable w 3 
becomes a true variable (cf. p. 136). Then, according to § 6 (p. 656), 
the associated action variable J 3 must become an integral multiple 
of h. This quantity represents 277 times the z-component of the moment 
of momentum. Since 2n times the total angular momentum is itself 
an action variable J 2 , which is an integral multiple of k, it follows 
that if we set J 3 = mh the inclination of the orbit is given by 


J 3 m 
cos % = y = T . 

V 9. K 


(58) 

We thus arrive at a directional quantization. For k = 1 only the 
values w = -f 1, 0, —1, i.e. the angles 0, 77/2, 77 are possible. It is to 
be noticed that the quantum numbers n and k are essentially positive 
quantities, but the “ magnetic ” quantum number m, on the other 

C1J j Can a ? sume botJl P osl tive and negative values since an external 
held determines a definite sense as well as direction. 

. Th j s Actional (or spatial) quantization, which is of especial 
importance m the Theory of Paramagnetism, was directly verified by 
the experiments of Stem and Gerlach.* In these experiments a fine 
pencil of atoms of silver or some other metal was produced by 
evaporating the metal in a high vacuum. The beam was sent through 
a non-homogeneous magnetic field whose gradient grad I H I had 
practically the same direction as H. Such a field exists for example 
e f T+ en . tW0 P° Ie P‘“ es > , one ° f which is a flat plate, the other a sharp 
fieW 'r f A p ma |a etl ° ™ P ° 6 ez P erlences a translational force in such a 
“n by ’ P ‘ 286)! thiS f ° rCe “ in the Action of H and is 

I F I = I ^ I I grad H | cos(^>H). . . . (69) 

If, as would be expected in the classical statistics, all orientations 
of ^ occurred as a result of the thermal motion, the b^m would 
xpenence a continuous broadening in the magnetic field In realitv 
however, the beam of silver atoms was found to break ud into two 
parts, corresponding to the orientations 0 and n There is amri-innl 

r£~ ari-trjtS 

t This^typ^of der Ph y ** j 74, p. 673 (1924). 

ment it is even more olosely appro^ateTbv^eI genC f°‘ ^ ° arrying out the experi- 
seotaon that ia out in the plate. d by ^ of a of rectangular cross. 
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multiple of Ji^tt (this holds for all atoms, not only for hydrogen), it 
follows from the magneto-mechanical parallelism (p. 463) that the 
magnetic moment must also be an integral multiple of the quantity 

eh 

= W7c = 9 ' 27 X 10 ' 21 e m u - • • • ( 6 °) 


The atomic ray experiments actually gave a deviation for silver which 
yielded | | = 1 -\l b , but one cannot expect the susceptibility measure- 

ments evaluated according to p. 458 to give integral numbers of mag- 
netons, since the directional quantization is not assumed in this evalua- 
tion (cf. p. 672). 

With the aid of equation (53) (p. 663) and equation (60), we can 
write at once the energy changes experienced in a magnetic field by 
an atom with secondary quantum number k , i.e. with moment of 
momentum kh/Zv: 

AA = — - 'pB. = -}- [j. B kH cos t = + f. i B Hm 

= ± 0 ^ I m I ^ *■ • < 61 > 


In order to find how the spectral lines are split up, we must derive 
selection principles for the spatial quantum number m. We again 
turn to the Correspondence Principle and use it to determine the 
nature of the light observed in various directions when the radiation 
takes place according to the classical theory. We can determine the 
time rates of change of the components of the dipole moment from 
the motions of the electrons. By p. 335 et seq. these quantities 
determine the radiation of the “ antenna system At the same time, 
we must bear in mind the fact that a Hertzian oscillator emits no 
radiation in the direction of its axis. Hence, if we observe in the direc- 
tion of the magnetic field (“ longitudinal effect ”) we do not perceive 
the z-component of the motion of the electrons, which is not affected 
by the field. The x- and y-components, which execute the Larmor 
precession, are observed completely. The Fourier analysis yields 
combination frequencies of the form sv x + v 2 + v 3 . In accordance 
with the Correspondence Principle, this means that only the quantum 
transition Am = + 1 occurs for radiation in the direction of the field, 
and that since the classical radiation is circularly polarized in the 
initial and final orbits, the emitted light is circularly polarized. In 
observing perpendicular to the field (“ transverse effect ”), the unin- 
fluenced component is observed completely. Its Fourier analysis does 
not contain v 3 , but only frequencies of the form sv 1 + v 2 , so that we 
have quantum transitions A in = 0 connected with radiation normal 
to the field. The light vibrations are then parallel to the field. Of 
the transverse components of motion, i.e. the x- and ^-components, 
the one in the line of sight will not be observed ; the fight sent out by 
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it is linearly polarized in a direction normal to the field. Since the 
combinations sv 1 ± v 2 ± again occur in the Fourier analysis, 
the transitions Am = +1 correspond to them. Then, on account 
of the fact that m { — m t is limited to +1 or 0, the lines are divided in 
the following way: 



ehH 

c 


(m, — m f ) = 0 


or 




ell 

4t rm el c 



In the longitudinal effect the value 0 is excluded and the light of both 
components is either right-handed or left-handed circularly polarized,* 
while in the transverse effect the undisplaced line is polarized in a 
direction parallel to the field, the side components in a direction normal 
to the field. This pattern of lines is called the normal Lorentz Triplet. 
The theory of this pattern was derived by Lorentz using the simple 
linear oscillator as a model of the radiating atom.f The Quantum 
Theory goes no farther; this is strange, since the normal Zeeman 
effect is shown by relatively few lines, and most lines exhibit much 
more complicated patterns containing large numbers of components — 
the so-called anomalous Zeeman effect. Nevertheless, regardless of 
how complicated the atom may be, the total angular momentum being 
constant in time is an action variable, and hence subject to the quan- 
tum condition; thus our formula is generally valid as long as the 
normal connexion between moment of momentum and magnetic 
moment obtains. 


9. Difficulties arising in the Atomic Theory of Magnetism. Explanation 
in Terms of the Spinning Electron. 

The anomalous Zeeman effect is not the only difficulty offered by 
magnetism in the mechanics of the atom. The fine structure of many 
lines, e.g. the division of the familiar yellow line of sodium 15 — 2 P 
mto two components A5890 and A5896 A.U. even in the absence of a 
field, is still unexplained. The components of this multiplet are really 
connected, and are not formed merely by lines which happen to lie 
near each other: if these lines are examined in a magnetic field of 
increasing strength, the pattern changes when the separation of the 
Zeeman components becomes comparable with that of the original 
n? an ^P a8ae f over a single normal Lorentz triplet for very strong 

Sfjr ’ J'T™ k ^ ow ? 1 “ the P*schen-Back Effect. One would conclude, 
then, that the doublet is caused by an internal magnetic field, and 

he location of this field would be assumed to be the core. But this 

blTlZ f the !' kali 10na oorr ® s P ondill g to the core are known to 
be diamagnetic, and so are incapable of generating a field. 

t . A. Lorentz, The Theory of Electrons. London. MaomilUn * ru /loom 
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Atoms with several valence electrons exhibit multisets of much 
greater complexity. According to our previous computations these 
components should all come together to form a single line. For sim- 
plicity we consider not the splitting of the lines , which are always 
given by the difference of two terms, but rather that of the terms 
themselves. The S term is always single. The multiplicity increases 
with A; up to a maximum value and remains constant thereafter; the 
multiplicity does not depend on the principal quantum number, and 
so is a characteristic quantity for each term sequence. The term 
system is designated as singlet, doublet, triplet, . . . according to the 
highest multiplicity. This multiplicity is written as a superscript to 
the left of the term symbol; thus 3 S signifies the “ triplet S state ” of 
the atom. An atom with several valence electrons has several term 
systems with various multiplicities. For example, the alkaline earths 
possess a singlet system and a triplet system. As we proceed from 
column to column of the periodic table the even and odd multiplicities 
alternate. The multiplicities of a singly-ionized atom correspond to 
those of the preceding neutral atom. The highest multiplicity in the 
term system of an atom is greater by one than the number of valence 
electrons, i.e. one larger than the number of the column of the periodic 
table in which the atom belongs. Thus, for an element of column VII 
— e.g. manganese — octuple terms can occur. Were it not for selection 
rules, 64 lines could result from combinations between two such octet 
levels. Despite the fact that this number is reduced by the existence 
of selection rules, the tremendous richness of the spectra of atoms 
with many equivalently bound electrons is easy to understand. An 
analysis, however, yields a representation of the spectrum as com- 
binations of a moderate number of multiplet terms. The lines re- 
sulting from the division of the terms may be represented formally 
by giving each sub-level a new quantum number j, introduced by 

Sommerfeld as the “ inner ” quantum number. The selection rule for 
• • 

; is 

Aj = 0 or + 1 , with the transition 0 -> 0 forbidden. 


Concerning the physical significance of j, Sommerfeld surmised at the 
very first that jh\ 2 tt represents the total angular momentum of the 
atom; this may now be considered a proven fact. For an alkali atom, 
whose core is not magnetic and which therefore has no angular momen- 
tum, j would then appear to be identical with k y the angular momentum 
of the orbital motion of the valence electron. But this is evidently 
not the case, for there are two different values of j corresponding 
to the same k in the P term; hence there must be something 
else in the atom which is magnetic and which possesses angular 
momentum. 

A further remarkable fact is the following: The spectra of the one- 
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electron systems H and He+ also exhibit “ fine structure ” which, how- 
ever, is quantitatively explained by a theory taking into account the 
relativistic variability of mass of the electron. Even in the absence 
of perturbations by the core, the orbits of various eccentricities have 
different energies on this account. For the alkalis, the splitting due to 
the different eccentricities is greatly increased by the core potential 
and the orbits of different eccentricity are separated into S, P, D terms, 
so that the relativistic effect plays a comparatively insignificant part. 
Nevertheless, the alkali doublets follow the same law as the relativistic 
hydrogen doublets as far as the dependence on nuclear charge is con- 
cerned. This might lead one to abandon the magnetic interpretation 
of the alkali doublets, yet there is still the possibility that both types 
of division follow the same law as a matter of chance. 

Finally, we refer in this connexion to the magneto-mechanical 
anomaly found in the Einstein-de Haas Effect (p. 463), in which values 
of the magnetism of ferromagnetic substances result which are twice 
as great as those to be expected according to the magneto-mechanical 
parallelism. 

All of these difficulties were removed at one stroke by Goudsmit 
and Uhlenbeck by their bold assumption that the electron itself possesses 
a magnetic moment and a mechanical angular momentum * The mag- 
netic moment is assumed to be one Bohr magneton, while the moment 
of momentum is taken to be only l/2(A/2-7r). These assumptions are 
justified by their results. The magnetic moment may be pictured as 
originating from rotation of the electronic charge about an axis, and 
it is not difficult to find the distribution of charge which gives the 
correct ratio of magnetic moment to angular momentum. However, 
no deeper meaning is to be attached to such a detailed picture of the 
structure of the electron, since it cannot be verified in any way. Never- 
theless, this concept led to the term spinning electron. We refer to the 
inherent angular momentum as the spin of the electron. It is now 
understood that the magneto-mechanical anomaly for ferromaenetio 
substances— -which are, without exception, metals— is due to the con- 
ductwn electrons while for salts— where the magnetism is the resultant 
ot the orbital and spin moments— the factor lies between 1 and 2 


1°. The Theory of Multiplets and of their Zeeman Effects. Quantum 
Theory of Paramagnetic Susceptibility. 

With the aid of the concept of eleotron spin one can readilv hrina 
order out of the chaos of multiplets. We lknit coLideratlons ^ 

^Tlfed i^orR^fl 1 ^ 13 wMch .. the interaction of the electron 

7 be represented by the 

1. AU spin vectors s, may be combined algebraically into a resultant 

• Goudsmit and a. Uhlenbeok, ZtiUch }. Phy,„ 35 , 8-9, p. 618 (1926). 
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spin s which is an integer or a half-integer (measured in units of A/2 tt): 

s = Es,. = L + 


Thus for two electrons the resultant spin is either 0 or 1. 

. All orbital angular momenta are to be combined vectorially to 
yueld an integral resultant. Here we must introduce a modification 
which shows, for the first time, the insufficiency of our mechanical 
models: In order to keep the results in harmony with experience we 
find it necessary to decrease the orbital angular momenta by one 
unit. If these were the actual momenta, it would mean that the S 
orbits possess zero angular momentum, and would thus be straight 
lines through the nucleus. Only the newer quantum mechanics, which 
abstains from all special models, is capable of leading compulsorily to 
this numeration. One less than the secondary quantum number for 

each electron is called l { . Thus l { = — 1. 

The integral resultant for several electrons is 
called the group quantum number l. The 
symbols S, P, D will be applied to the group 
quantum numbers, while the corresponding 
small letters will be used for the individual 
electrons. 

3. The resultants «s and l are to be com- 
bined vectorially into a resultant j which is 
integral (including zero) or half-integral, ac- 
cording as 5 is one or the other. The various 
integral or half-integral resultants are found by 
means of the following construction. Assume 
l > s. Describe a circle of radius s about the 
terminus of the vector l (fig. 3). The terminus 
of the resultant must lie on this circle. Its in- 
tegral or half-integral values are constructed by drawing circles about 
the initial point of l, using the integers and half-integers as radii. The 
intersections of these circles with that of radius s mark the possible 
end points of the resultant vector j. It is seen that for l > s the number 
of possibilities, for integral or half-integral spin resultant, is 2 s T 1. 
This is then the multiplicity r of a term system corresponding to the 
spin resultant s. Since the latter is at most N / 2 when there are N 
valence electrons, the maximum multiplicity is N -f- 1. The other 
empirical rules given on p. 668 are obtained with equal ease from this 
construction. We attach the quantum number j to the term symbol 
as a subscript, e.g. 2 2 P 1 / a . 

4. In a weak external magnetic field (additional magnetic energy 
small compared with the natural multiplet separation) the vector j 
sets itself with respect to the field in such a way that its projection on 
the field direction is a positive or negative integer or half-integer 


>6 
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according as j itself is one or the other. This gives (2 j -{- 1) positions, 
since the projection 0 is allowable only when j is integral. 

5. In a strong magnetic field, on the other hand, the coupling 
between the orbital and spin vectors is dissolved; the resultant orbital 
angular momentum orients itself in 6uch manner that its projection in 
the direction of the field is a whole number and, s imilar ly, the resultant 
spin sets itself so that its projection on the field is a half-integer. 

These five rules not only embody a correct representation of multi - 
plets, but also lead to a formula for their Zeeman effect which is almost 
exactly correct. 

If we now compute the additional energy due to the magnetic 
field by means of the expression — ^>H, we are not permitted to assume 
the normal connexion between the total angular momentum jhjbn 
and the magnetic moment, for j is a combina- 
tion of the normal orbital angular momentum 
with the anomalous spin. The true relation- 
ship is obtained from the following considera- 
tions: the invariable axis of the atom is given 
by the direction of the total angular momentum 
vector j . The vector of the resultant orbital 
angular momentum / precesses about this line. 

Since j is by construction the resultant of / 
and s, this means that the entire triangle OLS 
turns about its longest side. On the average, 
then, only the components of the vectors / and 
5 in the direction of; contribute to the magnetic 
moment. On account of the anomaly of the 
spm, the component of s must be included 
twice, so that the magnetic moment becomes (cf. fig. 4) 



1^1 = 


en 


and since 


4 7rm e tC 


[j + * cos ($/')], 


( 63 ) 


we have 


cos(^)= s i±Z^i 8 

2 SJ 


*>\=- ehj 


■ ■ ( 64 ) 

- (\ 4- _ • 

4im e ic y ' 2j 2 ) ~ • (66) 

The factor in brackets, g, which measures the ratio of the actual 
net,c moment to the one to be expected in the normal * , JSS 

nondCT^ ^ ab0Ve Value ' to ™er, does n^et cSS 

35 S .*?, ft— “ 

0 = 1 4- M±2L+ S(S + 1) ~ HI + 1) 

W+ij • 


. . ( 66 ) 
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This again demonstrates that the relations derived from models re- 
produce the facts approximately, but not absolutely correctly It is 

only the new Quantum Theory which is capable of justifying formula 
[ 00 ). 


The fact that g has different values in the initial and final states 
gives rise to the large number of components in the multiplet pattern. 
It follows from the energy increments 


and A E f = p. B g t m f E . . (67) 


that the n components of the transverse effect (polarized with the 
electric vector parallel to H), for which m t = m f = ±m, are given by 


h±v n = ±p B Hm(g t — g f ), 



and the o components of the transverse effect (polarized with the 
electric vector perpendicular to H), for which m { =m ±l,m f = ±m , 
are given by 

h\v a = ±n 3 H[m{gi — g f ) ±g x \ . . (69) 


As we see, the normal pattern is obtained for g { = g f — 1. 

If, however, the magnetic field is very strong , the resultant orbital 
and spin angular momenta orient themselves independently, and we 
have the energy increment given by 

\E = [L B m x H + .... (70) 


where m, and m s are the projections of / and 5 on the direction of H; 
and since m s can have only the values +1/2 or —1/2, the energy 
increment is that found in the normal effect. 

On the basis of these considerations the calculation of magnetic 
susceptibility performed on p. 458 must be revised also. There we 
found that the magnetization of unit volume was given, according to 
the classical statistics, by 



Nf H 
kT 


COS 2 d. 


Referring all quantities to 


one mol, the susceptibility then becomes 



Since cos 2 6= 1/3 in the classical statistics, we have a unique con- 
nexion between the susceptibility and the atomic moment p. Accord- 
ing to quantum principles, 

V ( 72 ) 

and since only 2 j + 1 orientations— viz. those for which the magnetic 
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quantum number m is a whole number — are allowable, the mean value 


cos 2 0 becomes 


cos 2 8 = 


1 

2j + 1 Tj f (2j +l)j , .6 3 j ' 


(73) 


This formula is valid for integral as well as half-integral values of j , 
as one may verify readily. Then we have 


X = 


_LYi(j+ 1W 


3RT 


(74) 


On account of (72), the Curie constant, which was L 2 p 2 /3R in the 
classical case, is now 


n _ L Vj(j + 1 W _ &P 2 (j + 1) 

3 R 3 R j * 


% 

• « 


(75) 


In order to calculate the elementary magnetic moment p from the 
observed value of C it would be necessary to know the value of j cor- 
responding to the atomic state in question. One must therefore pro- 
ceed in the opposite way by calculating g and then C from the values 
of the quantum numbers l , $ and j determined in some other way — 
spectroscopically or purely theoretically— and then compare the 
results with observation. A test of (75) which is free from objection 
must be undertaken by means of susceptibility measurements on para- 
magnetic, monatomic gases, since it is only here that disturbing in- 
fluences are excluded. Unfortunately, the experimental obstacles have 
not yet been overcome, so that one must turn to experiments on 
solutions containing paramagnetic ions. Among the ions of the rare 
earths we find very good agreement between theory and experiment; 
this is understandable because, unlike the ions of the iron group, the 
electron shell responsible for paramagnetism is not at the surface and 
so is not subject to disturbances from neighbouring molecules. These 
disturbances prevent application of (75) in the case of the iron group 
ut at the same time they make conditions simpler. To a first 
approximation take all the terms of the ions of the iron group to be 
6 terms. In this instance equation (75) becomes 


c== ^S{S±l) 

3 R 




2 . 


(75') 


i.e. the effective number of Bohr magnetons becomes 

V.tt -- ViS(S + 1 ). 


(75") 


' f 

evc^f er P ^otTtt d h hCre U T alWayS deTOl °P ed to the exclusion of 
anaio^of an incipient Paachen-Zk t 

Cl 711) 
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first time the general principle that the orientation of the spin vectors conforms 
to the magnitude of the energy of interaction. In a weak field the coupling 
between / and 5 overshadows that between the magnets and the field, so that 
. and 5 are first combined to produce/ which then orients itself in the field But 
m a fr °“S facld koto the coupling between l and the field, and that between 5 
and the fidd, exceed that between l and 5 . By the same principle the LS coupling 
is effective only when the energy of coupling between the orbit and spin of an 
electron is small compared with the magnitudes of these energies among the several 
electrons. I he spin-orbit coupling energy increases rapidly with the effective 
nuclear charge, so that the conditions no longer apply to heavy atoms, particu- 
arly those with many valence electrons. The other extreme case, called jj coup- 
ling, is encountered, for example, in the spectra of multiply ionized lead. In this 
case the spin and orbital momentum of an individual electron combine into a 
vector = /, ± h. The total angular momentum is the integral or half-integral 
vector sum of the j\, but there is no resultant l or 5 , so that the nomenclature 
based on LS coupling caimot be applied at all. If this is occasionally done in 
connexion with, say, Pb terms, it is by analogy with the terms of homologous 
systems having smaller nuclear charge. X-ray terms, too, are of the jj type. Of 
course the g formula becomes quite different for jj coupling. A different g value 
must be substituted in the first part of (75), but the second part remains un- 
changed because it contains only the magnetic moment and the quantum number 
J, which is valid in both cases. 


Ex. 132. Sketch the relative positions of the terms arising in the case of two 
electrons with l x = 1 and l 2 = 0 for (a) LS coupling, (6) jj coupling. 

Ex. 133. The Faraday Effect is the rotation of the plane of polarization of 
a beam of light when passed through a dense transparent medium situated in 
a magnetic field, the ray being directed along the field. Develop an approxi- 
mate theory of this elfect by combining the results obtained for the Zeeman 
Effect with the dispersion formula of p. 453. 


11. The Structure of the Periodic System of the Elements. Pauli’s 
Principle. 

After arriving at the concept that the electrons of the more com- 
plicated atoms are arranged in shells having various quantum numbers, 
it was natural to assume that a new shell is begun with each period, i.e. 
each time we come to an alkali metal. The principal quantum number 
of the valence electron would therefore have the values Li 2, Na 3, 
K 4, Rb 5, Cs 6. Hence the lowest term of Na, which corresponds to 
the normal state, must be given the principal quantum number 3. 
But there is evidence against this, for the effective quantum number 
n* obtained from the spectroscopic term value lies in the neighbour- 
hood of 1-5. There must therefore be a very large Rydberg correction. 
That this is really the case is shown by the following considerations: 
Let us examine the similar systems Mg + , Al ++ , Si +3 , P +4 , S +5 * in 
which there is likewise one valence electron and a core consisting of 
ten electrons. It is to be expected that as the effective nuclear charge 
—which amounts to 2 for Mg+, 3 for A1++, &c— increases, the per- 

* Fowler introduced another notation for the spectra of atoms in various stages 
of ionization: the “ arc ”, “ spark ” and “ super-spark ” spectra of an element E are 
referred to as E lf E it E a , &c., in this order. 
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turbations caused by the core will decrease. If we plot the values of 
the \S, 2 P and 3 D terms as functions of the atomic number, at the 
same time dividing each term by Z 2 in order to bring them to the same 
scale, we see from fig. 5 that both the S and the P terms converge 
toward the value of the three-quantum hydrogen level. Thus for the 
sodium atom the perturbation (caused by penetration of the orbit into the 
core) is really as great as indicated, and the above assignment of principal 
quantum numbers for the valence electron of the alkalis is justified. 

We follow Bohr in assuming an atom to be built up by adding 
electrons successively to a bare nucleus. It is then natural to assume 
that the quantum numbers of 


< L— 1 I I \ 

Na Mg At ** S*' 


the orbits of the first, second, v 
third, &c., electrons are in- 1 
dependent of Z, so that a 'I 
system containing N electrons 
would be a model of the atom \ 
of atomic number N* This , _ \ ~ 
rule, suggestive of the Funda- 

mental Law of Biogenetics, is in r- NT 

general obeyed, but is violated 2 

at certain places in the periodic '' <p 

table, i.e. for certain values of ^ T 

Z, for which a reconstruction - v — v " p — "Y — B/Jt 

takes place. That is, if we con- 
sider the spectroscopically de- 
termined quantum numbers 

corresponding to the lowest Na Mg + Al** Si 3 * 
terms — the normal states — of n 8 . s 

the atom, we find that the 

electrons in the first row are in s orbits. There is indeed no other 
possibility when » = 1, smce ignor, otherwise expressed, l<,n~ 1 
In the second row we find, besides s orbits, p orbits corresponding 

1ms oftM' ^, uantu “ Bum , ber 2. The ground states of thl 
atoms of the third row— the elements of which correspond both 

chemicaUy and spectroscopically to those immediately above them— 

eleSfi * P t °p blt8 ’ f tb0Ugh ’ “ ***• d ^ also tle 
expected for n = 3. In Row 4 there are certainly 4s orbits for K and 

ch Pm £°l\ if f ° U0Wlng . elements - Sc to Cu, exhibit spectroscopic and 
chemical behaviour quite different from that of the elements! the 

column number) are not observed. The elements Sc to SZ 

oiarge. m ° del W0Uld - ° f *» ° f • "*wd because of the greater nuclear 


** >C- 5 
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terized by the occurrence of many valences and by the formation of 
complex ions which are highly coloured and are paramagnetic, while 
the ions of all previous elements absorb light only in the far ultra- 
violet and are diamagnetic. For this reason it has long been held that 
the 3 d orbits, which do not appear up to this point, are inserted here. 
Bohr drew this conclusion from an extrapolation which has since been 
confirmed by analysis of the Sc* 4- spectrum: if the values of the lowest 
S, P and D terms for K and Ca + are plotted as functions of the atomic 
number, the extrapolation to Sc++ shows that the largest term corre- 
sponding to the lowest energy state is a D term. Thus from Z— 21 
onward, the nineteenth electron is no longer placed in a 4s orbit, but 
in a 3d orbit. The unusual properties of the elements in the sequence 
Sc to Ni are attributable to the fact that all their electrons over 18 
belong to an unclosed group. It is only when we get to Cu that we 
again find the essential spectroscopic features of an atom having one 
valence electron, although the occurrence of additional lines and the 
chemical bivalence indicate that the underlying shell is still vulner- 
able to outside disturbances. Since Cu has 29 electrons and has one 
4s electron, as has K (Z = 19), a total of 10 3 d electrons have been 
added in the series Sc to Ni. This subsequent filling out of a pro- 
visionally completed shell repeats in the following periods. The group 
of four-quantum orbits is completed with Kr, but in analogy with the 
other rare gases, this element has only 4s and 4 orbits. Nevertheless, 
the construction of the five- quantum electron group already begins 
at Rb; the filling in of this group is interrupted at Y, i.e. at the place 
corresponding to Sc. Again it is d orbits which are supplied to the 
four-quantum group in the series Y to Pd. It must be remembered, 
however, that the 4/ orbits, which are also possible, have not yet 
appeared. Ag, which is definitely univalent, possesses a 5s electron, 
as does Kb. The 6s and 5 p orbits are again closed at the element X 
and the valence electron of Cs travels in a 6s orbit. The filling in of 
the U group is resumed with La, the third element of the series. At 
tins point there begins a series of elements which are chemically very 
similar and which have no counterparts in the elements above them, 
lhese are the rare earths. There is every reason to assume that the 
insertion of the 4/ group— which is not yet present— begins here. The 
number of 4/ electrons to be supplied can be guessed by extrapolation : 
According to what we have seen, the relation between the quantum 
number and the number of electrons in a completed shell is given by 


Principal quantum number n . . 

1 

2 

3 

Number of electrons . . 

2 = 2-1* 

8 = 2*2 a 

18 = 2-3* 


It is to be expected, then, that there will be in all 2-4* = 32 four- 



ELECTRON CONFIGURATIONS OF THE ELEMENTS 


Is 


1 H 

1 




2 He 

2 





2s_ 

2p 



3 Li 

1 




4 Be 

2 




5 B 

2 

1 



6 C 

2 

2 



7 N 

2 

3 



8 0 

2 

4 



9 F 

2 

5 



10 Ne 

2 

6 




3s 

3 V 

3d 

4s 

11 Na 

T~ 




12 Mg 

2 




13 A1 

2 

1 



14 Si 

2 

2 



15 P 

2 

3 



16 S 

2 

4 



17 Cl 

2 

5 



18 A 

2 

6 



19 K 

2 

6 


i 

20 Ca 

2 

6 


2 

21 Sc 

2 

6 

1 

2 

22 Ti 

2 

6 

2 

2 

23 V 

2 

6 

3 

2 

24 Cr 

2 

6 

6 

1 

25 Mn 

2 

6 

6 

2 

26 Fe 

2 

6 

6 

2 

27 Co 

2 

6 

7 

2 

28 Ni 

2 

6 

8 

2 

29 Cu 

2 

6 

10 

1 


4^ 

4 p 

4 d 

5$ 

30 Zn 

2 




31 Ga 

2 

1 



32 Ge 

2 

2 



33 As 

2 

3 



34 Se 

2 

4 



35 Br 

2 

6 



36 Kr 

2 

6 



37 Rb 

2 

6 


i 

38 Sr 

2 

6 


2 

39 Y 

2 

6 

1 

2 

40 Zr 

2 

6 

2 

2 

41 Nb 

2 

6 

4 

1 

42 Mo 

2 

6 

6 

1 

43 Tc 

2 

6 

(6) 

(2) 

44 Ru 

2 

6 

7 

1 

46 Rh 

2 

6 

8 

1 

46 Pd 

2 

6 

10 


47 Ag 

48 Cd 

2 

6 

10 

1 

2 

6 

10 

2 




4 8 

4 p 

4 d 

4/ 

5 8 

5p 

5 d 

6s 

49 

In 

2 

6 

10 


2 

1 



50 

Sn 

2 

6 

10 


2 

2 



51 

Sb 

2 

6 

10 


2 

3 



52 

Te 

2 

6 

10 


2 

4 



53 

I 

2 

6 

10 


2 

5 



54 

Xe 

2 

6 

10 


2 

6 



55 

Cs 

2 

6 

10 


2 

6 


1 

56 

Ba 

2 

6 

10 


2 

6 


2 

57 

La 

2 

6 

10 


2 

6 

1 

2 

58 

Ce 

2 

6 

10 

2 

2 

6 

1 

2 

59 

Pr 

2 

6 

10 

3 

2 

6 

(1) 

(2) 

60 

Nd 

2 

6 

10 

4 

2 

6 

0 

2 

61 

Pm 

2 

6 

10 

5 

2 

6 

(0) 

(2) 

62 

Sm 

2 

6 

10 

6 

2 

6 

0 

2 

63 

Eu 
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6 

10 

7 

2 

6 

0 

2 

64 

Gd 

2 

6 

10 

7 

2 

6 

1 

2 

65 

Tb 

2 

6 

10 

9 

2 

6 

1 

2 

66 

Dy 

2 

6 

10 

10 

2 

6 

(1) 

(2) 

67 

Ho 

2 

6 

10 

11 

2 

6 

(1) 

(2) 

68 

Er 

2 

6 

10 

12 

2 

6 

1 

2 

69 

Tm 

2 

6 

10 

13 

2 

6 

0 

2 

70 

Yb 

2 

6 

10 

14 

2 

6 

0 

2 

71 

Lu 

2 

6 

10 

14 

2 

6 

1 

2 



03 

op 

5 d 

6s 

6p 

6 d 

7s 

5/ 


72 Hf 

73 Ta 

74 W 

75 Re 

76 Os 

77 Ir 

78 Pt 

79 Au 

80 Hg 

81 T1 

82 Pb 

83 Bi 

84 Po 

85 At 

86 Rn 


2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 


6 

6 

6 

6 

6 

6 

6 

6 

6 

6 

6 

6 

6 

6 

6 


2 

3 

4 

5 

6 
7 
9 

10 

10 

10 

10 

10 

10 

10 

10 


2 

2 

2 

2 

2 

2 

1 

1 

2 

2 

2 

2 

2 

2 

2 


1 

2 

3 

4 

5 

6 


Tc Technetium 
Pm Promotheum 
At Astatine 
Ft Francium 
Np Neptunium 
Pu Plutonium 
Am Americium 
Cm Curium 
Bk Berkelium 
Cf Californium 
E Einsteinium 
Fm Fermium 
Mv Mendelevium 


87 

88 

89 

90 

91 

92 

93 

94 

95 

96 

97 

98 

99 
100 
101 
102 


Ft 

Ra 

Ac 

Th 

Pa 

U 

Np 

Pu 

Am 

Cm 

Bk 

a 

E 

Fm 

Mv 


2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 


6 

6 

6 

6 

6 

6 

6 

6 

6 

6 

6 

6 

6 

6 

6 


10 

10 

10 

10 

10 

10 

10 

10 

10 

10 

10 

10 

10 

10 

10 


2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 


6 

6 

6 

6 

6 

6 


1 

2 

1 

1 


6 (1) 
6 (1) 
6 (1) 
6 (1) 
6 (1) 
6 (1) 
6 (1) 
6 (1) 
6 (1) 


1 

2 

2 

2 

( 2 ) 

2 

( 2 ) 

( 2 ) 

( 2 ) 

( 2 ) 

( 2 ) 

( 2 ) 

( 2 ) 

( 2 ) 

( 2 ) 


5/ 


( 2 ) 

(3) 

(4) 

( 5 ) 

( 6 ) 

(7) 

( 8 ) 
( 9 ) 

( 10 ) 

(ID 

( 12 ) 


Closed shells are not repeated in 
parentheses are conjeotural. 


the succeeding parts of the Table. Numbers in 
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quantum electrons, of which 18 have already been placed, leaving 14 
rare earths. This result, found by means of what is in a sense numerical 
mystics, has proved to be of great practical importance — it led to the 
discovery of element 72, which was named Hafnium, after Copen- 
hagen, where Bohr resides. While this element was formerly looked 
for among the rare earths, this suggests that it is homologous with Zr, 
and, as a matter of fact, the zirconium minerals contain halfnium in 
considerable quantity. As in the preceding periods, the filling in of 
the 5 d group is resumed with Hf and completed at Pt. Au, like Cs, has 
again a 6s electron, and in the following elements as far as Rn (radium 
emanation) the rare gas configuration, consisting of s and p electrons, 
is again built up, so that the seven-quantum electrons begin with the 
following row. The subsequent insertion of the 6 d electrons is resumed 
with Ac. The construction of the five- and six-quantum electron shells is 
not completed, since the periodic table stops at element 92. The reason 
for this termination is undoubtedly that the heavy nuclei are unstable, 
since all elements from 84 onward are radioactive. The “ transuranic ” 
elements 93 to 98 have recently been produced artificially in the United 
States. As to the electron shells of these elements, the 5/ shell appears to 
begin with Ac, so that the atoms from this point onward form a new 
“rare earth ” group that has been named the actinide series. 

The number of electrons of the same quantum number n contained 
in a completed shell follows from a simple but far-reaching rule dis- 
covered by Pauli.* If a certain direction is marked by means of an 
external magnetic field, the state of a spinning electron is determined 
by four quantities: major axis, eccentricity, and orientation of the 
plane of the orbit and orientation of the axis of spin. These are charac- 
terized by the quantum numbers n, l — k — 1, and the projections 
m t and m t . Now the Pauli principle states that each combination of 
the four quantum numbers can occur but once in any atom, i.e. in 
no atom can there be two electron orbits for which all four quantities 
agree. If we remember that m t can have the values +1/2 and —1/2, 
we can dispense with the spin number by saying that in any atom there 
are at most two electrons for which n, l and m x agree. For a quantum 
number 1 = 1 the projections are m x = 1, 0, — 1 ; for l = 2, the possible 
projections are 2, 1, 0, -1, -2; in general, 21 + 1 different values of 

on account of the fact that the number of 
possibilities is doubled by including the spin, there are 2 $ electrons, 6 
V electrons, 10 d electrons, &c. Since l assumes all values from 0 to 

n — 1, the number of n-quantum electrons obtained by summation 
over all possible values of l is given by 

N n +' n i\(2l + 1) = 2n\ 

0 

• W. Pauli, Zeitschr.f. Phya., 81, p. 766 (1926), 


• ♦ ( 76 ) 
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i.e. exactly the relation given by what we referred to above as numerical 
mysticism ! 

Despite the fact that the Pauli principle offers a simple formula 
for the number of electrons in a completed shell, the problem of the 
periodic system cannot be considered fully solved until it can be 
shown why the construction of the three-quantum orbits, for example, 
is not completed in one step but is resumed at Z = 21. Attempts to 
solve this as a problem in atomic mechanics would lead to the hope- 
less task of computing the energy of each configuration. Permi made 
a start toward a solution by treating the electrons in the shells as an 
electron gas. They are then subject to the Fermi-Dirac statistics 
(p. 631), and in this way it was found possible to give a very satis- 
factory answer to the above question. 


12. Elements of the Theory of Band Spectra. 

Spectroscopists have distinguished for a long time between line 
spectra , which are due to free atoms, and band spectra , in which a mul- 
titude of lines — sometimes unresolved — present the appearance of a 
grooved or fluted surface. Even before the development of the theory 
it was surmised that band spectra are to be attributed to molecules. 
The theory of these extremely diversified spectra is exceptionally 
complicated and extensive, but offers an unexpected insight into the 
structure of molecules. However, only an outline of the theory can be 
reproduced here. 

The complex nature of band spectra is due to the fact that a mole- 
cule can contain and lose or gain energy not only in the form of elec- 
tronic energy but also as energy of rotation and internal vibration. We 
confine our attention to the theory of spectra of diatomic molecules; 
among emission spectra these occur most frequently. Let us represent 
a diatomic molecule by a dumb-bell model. In addition to the energy 
of the electrons, the energy of this system can consist of rotation 
about an axis normal to the figure axis and oscillation of both nuclei 

along the figure axis. _ . 

Let us first consider pure rotation spectra, i.e. spectra resulting 

when only the state of rotation of the molecule changes. The quantum 
states of a rotator of moment of inertia I are obtained as follows. 
Since there is but one moment of inertia I normal to the figure axis, 


E = £ k in = i I4> 2 ( 77 > 

n = H < 78 > 


If ^ is constant, the quantum condition 

£ P'fidcf) = = mh, 
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Hence the energy is 



m 2 h 2 

877 2 Z 



On the classical theory, the rotation of a molecule can result in radia- 
tion only if the molecule is polar, i.e. when the centres of gravity of 
the positive and negative charges do not coincide, in which case the 
molecule has a positive and a negative pole. In this case the system 
represents a combination of two Hertzian oscillators, for which the 
results of p. 341 hold. This is the case, for example, for the halogen 
hydrides, but not for the symmetric molecules N 2> 0 2 , &c. According 
to the Correspondence Principle, then, pure rotation band spectra 
are possible only with polar molecules. It follows, further, from the 
uniform simple rotation (absence of overtones) that m can change only 
by one unit, so that the transition m -J- 1 — >• m takes place in emission. 
For the emitted frequency this gives 

” = A [(TO + 1)2 ~ (» + *>> • ( 81 ) 


i.e. there is a sequence of lines which are equidistant on the frequency 
scale,* and whose separation is 




Substituting this value in (81), we obtain a relation between the fre- 
quencies and their differences which contains only observable quan- 
tities: ^ 


v — A v{m + £). 



Remarkably enough, 
we must write 


experience shows that instead of this relation, 


y = A 


v . m. 


(83') 


Tbs form would be obtained by giving m half-integral values but no 
reason for doing so is indicated. The same result is obtained bv re 

forZTwe m hi 8 ve ^ " ( " + ^ ** ^ “ the 0830 ° f ^ 9 formula, 


(83") 


” ~ SB [{m + 1 H”‘ + 2 ) -«*(» + 1)] = ~ (m + 1), 

wboh means simply an increase of the current number by one, and 
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thus agrees with equation (83'). If we again reduce the current number 
by one and once more call it m, we get the final form 


hm 

±nH 



The newer quantum theory yields precisely this expression for the 
energy of a rotator. Thus we see once more that the theory based on 
models agrees with observation only after introducing a modification. 

We now go a step further and assume that the state of oscillation 
also experiences a change. The resulting bands are called rotation- 
vibration bands. While the pure rotation bands lie in the far infra-red 
— in the neighbourhood of 30 /x — the rotation-vibration bands fall in 
the near infra-red, in the vicinity of 5 /x. Both types of band are 
observed in absorption, but it is more convenient to develop the theory 
for emission. The absorption lines coincide with the emission lines, 
for a large number of molecules already possess several quanta of 
rotational energy on account of the thermal motion, so that the initial 
levels of a large number of molecules already contain several rotational 
quanta, in contrast with the atoms, where the thermal energy is not 
sufficient to provide an appreciable number of excited atoms. The 
absorption lines are obtained by interchanging the initial and final 
states. 

By p. 94, the quantum states of a harmonic oscillator in which 
a mass M vibrates about a position of equilibrium are given by 



In our model the nuclei vibrate in opposite phase, so that the centre 
of gravity remains at rest. The calculation of the characteristic vibra- 
tion of this system is a problem quite similar to that on p. 651 , where 
the motion of the atomic nucleus was taken into account. Here, too, 
we find (cf. Ex. 43, p. 110 ) that it is only necessary to replace the mass 
of the particle by the reduced mass /x, where 


1 





Hence the energy states of a molecule which are due to oscillation and 


rotation are 


E n , m = nhv 0 + 


hhn\m + 1) 
8t7 2 / ’ 



As lone as the binding force on the nuclei is strictly harmonic, the 
Correspondence Principle again gives An = ±1 as the selection nil 
for n. Nevertheless, on account of the fact that the restoring for 
not perfectly harmonic, higher quantum transitions also occur; tPese 
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are referred to as overtone bands. The frequencies of the lines of the 
fundamental band are, according to (83 ,// ), 




Here the sign of the rotational part may be either positive or negative. 
Since the oscillational contribution is by far the greater (it displaces 
the system toward the near infra-red), it may happen, in emission, 
that a transition n -\-l ->n occurs together with a rotational change 
m — 1 ->• m without any change in sign of the transformed energy. 
The lines thus appear on both sides of the null line v Q , which is itself 
absent. The quantum mechanical treatment of the oscillator also leads 
to half quantum number n, but this does not alter (87). 

If, in addition to the energy transformations already considered, 
the state of the electron shells of the molecule also changes, the emitted 
radiation falls in the visible region or in the ultra-violet. This repre- 
sents the usual band spectra which have the appearance of illuminated 
columns. The essentially new feature contributed by the electron 
transition is a change in the moment, of inertia and in the interatomic 
bond by penetration of an electron into the electron shell which is 
responsible for the binding force between the atoms. We must there- 
fore distinguish between the moment of inertia in the initial state and 
that in the final state, and also between frequencies of oscillation in 
the two states. Denoting the electronic energy by E eh the energies 
of these two states are 


E, = ^ + n t hv ( + 1} , 1 

E, = W + n,hv f + *> . 

077 ‘■If 


. ( 88 ) 


These constitute the quantum-mechanical expressions for the energy 
states, insofar as the electron shells possess no resultant spin If the 
latter does not vanish, the total spin j of the molecule is an* integral 
multiple of £/2 t 7 and, according to the kind of coupling of the indi- 
vidual spm vectors, there will be many small differences which cannot 
be entered mto here. In order to keep the formulas manageable in 
such cases, we make use of the fact, mentioned above, that except for 
a small change in the constant term we get the same result by replacing 

bgly wewrir' me rU ° 8qUar<3 ” ** + l) by {x + *>■ Acoord- 

B ‘ = + n < hv < + 8^ U< + i) 2 . 

Ef = %r + n t hv t + JL (j, + *)«. 


• t 


( 88 ') 
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The Correspondence Principle states that transitions to j, =j originate 

Ji = j + 1 and ji =j — 1. However, in case of a non-vanishing 
electron spin the transition j -*j is also permitted. If we include 

this case, the following formulee result for the three branches of 
a band: 


R- (positive) Branch: v = A -j- 2B(j -j- 1) -f C(j -f l) 2 , ' 
P- (negative) Branch: v — A — 2B j -f Cj 2 , 

Q- (zero) Branch : v = A -f- Cj -f- Cj 2 , 



where the abbreviations A, B and C are defined as follows: 


A = u e , -j- n i v i — n f v, + \ 


2 B = 


C = 





As mentioned, the Q-Branch is absent if the spin of the electron shells 
is zero. The designations R , P and Q are arbitrary and have an historic 
origin. 

The positions of individual lines are represented very vividly by 
a diagram due to Fortrat (fig. 6). In a rectangular system of co- 
ordinates take v as abscissa and m as ordinate. For the present assume 
m to be continuously variable. Then we obtain three parabolic arcs 
for the three branches. But only the points whose ordinates are whole 
numbers correspond to spectral lines. We must therefore cut these 
curves by the lines m = 1, 2, 3, . . . and project the points of inter- 
section on to the v-axis. It is then easy to see the way the head of the 
band originates; it is a point toward which the lines crowd together, 
but is not a mathematical accumulation point like a series limit. This 
representation also suggests why a band has the superficial appearance 
of being a spatial configuration. 

If we now consider the bands which belong to various values of 
Hi and n f , we must remember that the harmonic binding is merely 
a first approximation to the power series development of the actual 
law of force. As a result, the selection rule for n is not obeyed. The 
change in the energy levels caused by the higher powers in the law 
of force will not be taken into account here. The contribution of the 
change in the state of oscillation then becomes 

v Q9 = n,- v { — n f v f = (w< — n t ) v { + «/(^< — *7)* • (^1) 
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The second form yields a subdivision of the doubly infinite manifold if 
the oscillation frequency is not changed very much by the electron 
transition. In that case the second term is small compared with the 
first and we can collect all partial bands corresponding to the same 
change of n into one band group , while the entire band system is ob- 
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Fig. 6 


tained by admitting all possible values of (n { — n f ). All lines belonging 
to the same band system correspond to the same electron transition. 
Thus the event which gives rise to a single line in. atomic spectra is 
the cause of a band system consisting of many thousands of lines. 
However, in this connexion it must be remembered that only one 
of the many lines is emitted or absorbed as a quantum hv in each 
elementary process. If the electronic states of molecules were as 
numerous as those of atoms, we would obtain a maze of many band 
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systems. Fortunately the number of possible energy states of the 
electron shells of molecules is, evidently, considerably smaller (except 
for H 2 and He 2 ), so that a molecule generally possesses only a few 
band systems. The reason is that more violent disturbances of the 
electron shell which effects the bond lead to the decomposition of the 
molecule. 



CHAPTER XL 


Critical Atomic Theory: The New Quantum Mechanics 


1. The Wave and Particle Aspects of Light. Compton Effect and Raman 
Effect. 

Although the phenomena of interference appear to prove that 
light is of the nature of waves, we have always treated emission and 
absorption as though the light consisted of individual energy centres 
(quanta) of amount hv which could not be subdivided. This quantum- 
like energy transfer has been rigorously demonstrated by an enormous 
number of experiments. Attempts to introduce this process into the 
wave picture by assuming that the atom could store up the electro- 
magnetic energy incident on it until the quantity hv is reached, 
lead to quite impossible accumulation times for X-rays, so that 
this procedure was soon abandoned. Attempts to give the quanta 
themselves such properties that interference phenomena would result 
without waves proved equally unfruitful. There remains no other 
course but to look upon waves and quanta as two observable aspects 
of a single phenomenon whose true nature cannot be described in 
terms of any mechanical model. A crude representation of this dual 
property of light by assuming certain points on the wave front to be 
the seat of the energy also proved impossible. In no way can we 
spatially localize the quanta in a light wave. The true source of these 
difficulties will become apparent in the following section. We shall 
begin with the discussion of a phenomenon which more than any 
other brings into prominence the corpuscular nature of light, and 
which was therefore considered for some time to be the expert - 

mentum crucis for the quantum nature of light. This is the Compton 
effect* F 

If monochromatic X-radiation is allowed to fall on a scattering 
substance— e.g. a block of paraffin— and if the laterally scattered 
radiation is examined spectroscopically, the original line is found to 
have a companion on its long-wave side, the separation of the two 
depending upon the angle between the primary ray and the direction 


* A. H. Compton, Phya. Rev., 21, 
theory waa given almost at the same 
(1023). 


p. 483 (1923); Phil. Mag. % 46, p. 897 (1923). The 
time by P. Debye, Phyeikal. Zeitschr., 24, p. 181 
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mv 


of scattering (direction of observation). The position of the secon- 
dary line can be computed in a frankly naive way by considering the 
scattering process to be the collision of a light quantum (or photon ) 
with an electron. The energy delivered to the electron according to 
the elementary laws of impact must result in a decrease of energy — 
and hence a decrease in frequency — of the primary ray. We may look 
upon the electron as being free if it is one of the outer electrons of an 
atom, for in the X-ray region the quanta have energies of several 
tens of thousands of electron volts, while the work of removal of an 
outer electron is of the order of but a few electron volts. This, in fact, 

is the reason that the pheno- 
menon is observed only in the 
X-ray region. We now write 
the equations of conservation 
of energy and conservation 
of momentum for this colli- 
sion process. The latter we 
shall decompose into its com- 
ponent equations for the 
direction of the primary ray 
and the direction normal to 
it. Denote by v the fre- 
quency of the scattered light, by m the mass of the electron, by </> the 
angle between the primary ray and the scattered ray, and by 8 the 
angle between the primary ray and the velocity vector of the recoil 
electron (cf. fig. 1). By the principle of the inertia of energy 
(p. 258), the “ mass ” of the photon is 

hv a) 





so that its momentum is 


m Q c = — . 


hv 

c 


( 2 ) 


( 3 ) 


We then have the energy equation 

hv= hv' + \mv 2 t 

and the two component momentum equations 

hi = cos (f> -f- mv cos 8 
c c 

hv 

0 = — sin <f> — mv sin 8 
c 

Since we are interested only in the frequency v of the scattered light 


. ( 4 ) 
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as a function of the angle of scattering <j> y we eliminate 
two momentum equations and obtain 




2vv cos (j>). 


(5) 


For light of not too short wave-length we have v — v (this 
approximation is not justified for y-rays), so that we may replace v 
by v in the term mv 2 / 2 of the energy equation. From (5) we then have 

(6) 


k 2 i/2 

hv = hv -{- ^7^2 (1 — cos <£), 


whence 


me 


Av = v — v — 


2kv* . 2 <j> 

— = 3m I 


me 


(7) 


This equation may be put in more elegant form by introducing the 
change in wave-length 


AA= - 


cAi 

T 2 


in place of the frequency change. Then there results 

AA = — sin 2 ' = 2A 0 sin 2 
me 2 2 



The wave-length A,, introduced in this way is the wave-length of 
radiation whose photon has an inertial mass equal to that of the elec- 
tron, for then we have 

he hmc 

m<7 ~ V 2 ~ ~ch ~ m 

The numerical value of \ is 24 x 10~ u cm., so that this wave-length 
falls in the y-ray region. Despite the fact that we neglected the rela- 
tivistic variability of mass in deducing equation (7), it is strictly cor- 
rect, for exactly the same final result is obtained when this effect is 
taken into consideration. 

^In examining the scattering of visible light by liquids or gases, 
Raman* discovered an effect which is superficially similar to the 
Compton effect, but is fundamentally different from the theoretical 
standpoint. Here again displaced lines are found near the primary 
line, but their displacement does not depend on the direction of 
scattering, as it does in the Compton effect. This phenomenon, 
too, may be interpreted, in its larger features, as a division of 
the photon into smaller quanta in the scattering process. Part of 
the energy of the primary quantum is given to the scattering mole- 
cule as energy of vibration or of rotation, or else some of the molecular 

(1828); Lf£Tp. Phy ‘- *’ UaIOh (1928 >- 181. P. Ml 


/ 
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energy is added to the primary photon. The equation of energy is 

hv = hv + nkv 0 , (9) 

where v 0 must be one of the characteristic frequencies of the molecule, 
as determined from its band spectra. The details of this phenomenon 
require a more exact theory, the basic features of which will be given 
in Chap. XLI, § 3 (p. 716). 

2. The Inexactness of Atomic Observations. 


Heisenberg was the first to recognize the fact that the motions of 
electrons in the atom are, as a matter of essential principle, incapable 
of being verified experimentally. In celestial mechanics we can check 
our calculations by computing the future configuration of the heavenly 
bodies from their instantaneous positions and velocities and then 
comparing the results with observation. For the electron system of 
an atom, however, the rigorous determinism of classical mechanics is 
lacking — we cannot ascertain the necessary initial conditions. This is 
inherently impossible, quite apart from technical difficulties, the 
reason being that any means of observation by which atomic pro- 
cesses are conveyed to our perceptions causes a disturbance of the 
electron system of the atom. 

If we allow our imagination free rein and disregard all technical 
difficulties, the first thing we might try is to determine the instan- 
taneous positions of the electrons with the aid of a microscope. The 
limit of resolution of this instrument is given by A/2 A, where A is the 
numerical aperture, which we take equal to unity — i.e. we take the 
angular opening of the incident pencil to be tt/2. If we now demand 
that the position of the electron is to be determined to within at least 
1 per cent of the radius of the atom (10 -8 cm.), we must employ 
light whose wave-length is of the order of 10 -11 cm., i.e. 1 X.U. The 
relative change in wave-length caused by the Compton effect is ex- 
tremely large for such short waves, as one sees from the fact that for 
the characteristic wave-length 24 X.U. and a scattering angle of tt/2 
we already have AA = A. From (5) the momentum of the recoil 

electron is approximately * 

mv = 


2 hv . (/> 


sin o 


( 10 ) 


This is the change in momentum of the electron resulting from its 
observation by means of y-rays. Since we assume a range m angle 
of from 0 to tt/2, corresponding to the aperture of the microscope, the 
momentum is determined only to the quantity 



mv 


h 

A’ 



Putting v ^ 
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since all the light contributing to the image must be scattered by the 
object and so must participate in the Compton effect. We may measure 
the velocity by analysing the scattered light spectrally, determining 
the velocity of the scattering particle from the change in wave-length 
caused by the Doppler effect. But we do not know how to take into 
account the velocity change due to the Compton effect. The original 
velocity might have been changed by the act of observation, in which 
case the error of observation lies in the range A p. If we use longer 
waves in order to reduce the Compton effect we make the determination 
of position uncertain. On account of diffraction, the error A q in the 
position determination is of the order of magnitude of A, hence the 
product of the error in the position co-ordinate by that in the momen- 
tum co-ordinate is 

ApAg rs h (12) 


This equation represents one of the most important principles in 
theoretical physics; it shows us the limits of applicability of mechani- 
cal causality. It has pointed out new paths in atomic physics: no 
longer is it a task of the theory to disclose subatomic processes; rather, 
the theory must — without considering phenomena which are inacces- 
sible to observation — create a mechanics which reproduces the 
observable quantities — the spectral lines and the energy levels — from 
a model, of which we are required to know only the component parts 
and the acting forces but not the mechanical constants of integration. 
From this point of view we can understand, for example, why attempts 
to explain chemical bonds by introducing phase relationships between 
the electron orbits of the constituents were uns uccessful. 

The indeterminacy relation, which holds for any pair of canoni- 
cally conjugate variables, becomes of especial importance when applied 
to energy and time:* r 

A E .A t h (13) 

Since the quantum energy states are usually thought of as being 
absolutely sharp, this implies that we can say nothing regarding the 
tune an atom remains in such a state. Actually, it is found that after 
eliminating all broadening influences like the Doppler effect, a spectral 
Une stiff has a finite width, so that the energy levels must be considered 

to have a fimte— although very small— breadth. From this we deduce 

° f the ° rder ° f 10 ' 8 sec ‘» which 18 interpreted as the 
time the state persists. 

These considerations throw new light on the failure of all 

V 0 ^ * h . eore * lcall y the process of emission itself on the 
ba 8l s of the concepts developed in the preceding chapter. Our ordinary 

nsar o a 7 rJ u r 
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ideas would lead us to require of such a theory that it should furnish 
an equation which gives the amount of energy radiated at each instant, 
i.e. E =f(t). However, such a relation could not be tested experi- 
mentally. If, for example, we divide the time of emission, which is 
empirically known to be of the order of 10" 8 sec., into 10 7 intervals— 
a continuous function must permit of any arbitrarily small division 
of the variables — and if we require that it be possible to give the 
energy content of the atom for each of these intervals, we find that 
we are asking the impossible, for the indeterminacy relation gives for 
the error in the energy 

&E = 6 " 6 iQ-ls ” = 6-6 x 10-12 erg. 


But the work of excitation of, say, the line A2537 is about 5 electron 
volts, or 



5 x 4*8 x 10 -10 
300 




10 12 erg. 


Comparison of the results shows that the error in the energy is about 
as large as the entire amount of energy available. Parenthetically, it 
might be remarked that for the radiation from a macroscopic antenna, 
on the other hand, neither the error in the time nor that in the energy 
is important, on account of the magnitude of these quantities, so that 
the description of the process of emission by means of a Hertzian 
oscillator adequately represents the facts. 

If we extend the indeterminacy relation, which was originally 
proved for electrons, to all the elementary particles — electrons, nuclei, 
photons — for which analogous hypothetical experiments can be devised, 
we see at once why it is experimentally impossible to localize the 
quantum in a monochromatic wave train of frequency v. In an 
i nfini tely long wave train — in this case only is the wave mono- 
chromatic — the momentum hv/c of the photons can be given exactly 
by means of spectroscopic determination of the wave-length, but this 
makes the position co-ordinates completely indeterminate. On the 
other hand, if the time at which a photon passes a given point in space 
is confined between certain limits by using a wave train of finite length, 
the light is then no longer monochromatic, for it can be represented 
by a band of infinitely long trains of Fourier waves (cf. p. 57). In 
the curve which gives the intensity as a function of the frequency, 
the distance between the two frequencies for which the intensity has 
half its maximum value is given by 




If we now use the hv law to connect the frequency with the energy 
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E of the photons we obtain the indeterminacy relation (13) from the 
equation (IT) which is derived solely from considerations of wave 
kinematics. Thus the light waves accomplish for the photons exactly 
what we need to confirm the Heisenberg indeterminacy. It then 
becomes quite natural to try to find a similar correlation of waves 
with the motion of electrons. The idea of “ matter waves ” was first 
propounded by L. de Broglie. This will be considered in detail in the 
following sections. 


3. Matter Waves. 

For light, the paths of the photons — the rays — are the orthogonal 
trajectories of the wave surfaces. As long as all apertures and screens 
are large compared with the wave-length of the light we can neglect 
diffraction phenomena and can compute the paths of the rays by 
means of Fermat’s Principle 

8 Aufe = 8 ~ = 0 .... (15) 

Jp . j p. u 


(u is the phase velocity) if we have given the index of refraction as a 
function of position. But as soon as we pass to objects whose size is 
of the order of magnitude of a light wave, simple geometrical optics is 
no longer applicable and the phenomena connected with diffraction 
can no longer be described in terms of light rays. We must in fact 
consider the wave disturbance which exists at a given surface and 
compute the subsequent spread of this disturbance (cf. p. 376). 

On the other hand, a similar extremal principle — that of Hamilton 
(p. 123) — holds for the paths of particles which obey the classical 
mechanics. The analogy with optics becomes particularly evident if 
we restrict the comparison orbits to those of the same total energy. 
This is equivalent to passing from Hamilton’s Principle to that of 
Maupertuis. By Hamilton’s Principle we have for a particle, 

0=8 f tl (2T — E)dt = 8 f l 2 Tdt=h mv*dt. 


We can now introduce the length of path as the variable of integration 
in place of the time: 


0=8 f* mu j dt = 

K dt 



or since m is constant, 




Comparing (15) and (16), we see that if we wish to associate waves 
with particles moving according to the laws of mechanics in the same 



694 


ATOMS, MOLECULES AND SPECTRA 


[Chap. 


way as light waves are associated with light rays, we must set the phase 
velocity u proportional to 1/v in order that Fermat’s Principle shall 
be valid for the matter waves. Again, if we denote the potential energy 



C 

i.e. u = — 

Ve-u 



If we speak of a wave function we must introduce a frequency. It is 
natural to do this by means of the old hv relation: 

E ~ h '■ “-teStt <>») 


This means that the phase velocity depends upon v ; hence there 
exists dispersion of our tentatively introduced waves. 

Now in a dispersive medium it is necessary to distinguish between 
the phase velocity u and the group velocity u g . The latter is the 
velocity which we can measure directly by means of a recording 
mechanism, while the individual extrema in a wave train of constant 
amplitude are not distinguishable. For a material particle, on the 
other hand, we can, of course, measure the actual velocity. Hence 
we demand that the group velocity of the waves associated with a 
particle shall coincide with the velocity of the particle. Using the 
relation between group velocity and phase velocity given on p. 66, 
and making u g = v, we have 



Employing the hv law and integrating, 

= i y /4 2 m(kv — U). . (20) 


We can therefore state a dispersion law which fulfils the requirements 
1 Ju oc v and u g = v if we replace the constant C in equation (18) by 

hv/^(2m)* 

We then have 

„ hv (2D 

\/2 m(hv — U) 



• In equation (18). O can still contain E and hence hv, since all comparison paths 
are assumed to have the samo energy# 
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The wave-length of the matter waves is given by 

^ u h h 

v \/2m(hv — U) \Z2m(E 


- U ) 


In a region free of force, 


Aq — 


V2mE rnv 


. ( 22 ) 


(22') 


Passing from a region where U = 0 to one where the potential ZJ 
exists corresponds to entering a region whose index of refraction is 

(23) 


If U is a function of position, we have a medium of variable index of 
refraction in which the orthogonal trajectories of the wave surfaces — 
i.e. the orbits — are curved. 

If, for example, we take U to be the potential of gravity and con- 
struct, according to geometrical optics, the rays of equal E passing 
through a fixed point, we obtain all parabolic trajectories having the 
same initial velocity. 

Up to this point the analogy between the mechanics of a particle 
and geometrical optics may seem to be mere idle speculation. If it 
is to be more than that, it must be possible to detect phenomena 
which justify the introduction of associated wave fields — i.e. inter- 
ference and diffraction phenomena — for particles, as was done in 
optics to prove the wave nature of light. If the “ de Broglie wave- 
length ” of an electron which has traversed a difference of potential 
V is calculated by (22'), we obtain the simple relation 



where V is measured in volts and A is given in A.U. This shows that 
electrons whose velocity is several hundred or several thousand volts 
are associated with wave-lengths corresponding to those of X-rays. 
Hence we may expect the same diffraction and interference pheno- 
mena as for X-rays; in fact, Davisson, Kunsman and Germer had 
previously found selective maxima when a beam of electrons was 
reflected from the surface of a nickel crystal at given glancing angles 
corresponding to those of X-rays of the same wave-length * The 
detection of the “matter waves” is even simpler by the Debve- 
bcherrer method. It is necessary only to send a narrow ray of electrons 
of uniform velocity through a thin metal foil; diffraction by the 

(192*7?r C ’ Pftvi580n and L ' Gerraer » U9. P- 05S (1927); Phy 9 . Ptv. t 30, p. 705 
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variously oriented crystals gives a ring diagram which is as perfect as 
any obtained by means of X-rays. At those places where the wave 
computation indicates destructive interference no electrons are found, 
while at maxima we observe a large number of electrons. 

Stern has even observed matter wave interference by reflecting a 
beam of helium atoms from the surface of a crystal.* According to 
equation (22'), the greater mass in this case yields a wave-length 7300 
times as short, for the same velocity. But this is compensated by the 
fact that the velocity (of thermal agitation) is much smaller in this 
instance, so that the de Broglie wave-lengths again fall in the region 
which is readily measured by means of crystals. 

If we wish to account for the more detailed phenomena of dif- 
fraction in optics, we must turn to the wave differential equation. 
Similarly, in atomic mechanics, where we also deal with quantities com- 
parable with the wave-length, we must use an analogous wave equation 


where 


1 3 2 T 
AT = — — — 
u 2 dt* ’ 


E 


u = 


\Z2m{hv — U) \/2 m(E — U) 


• • 


(25) 


(26) 


The meaning of the quantity T which satisfies the wave equation is 
still undetermined. We require of T only that its square — or, if it is 

complex, its norm TT — be a measure of the intensity of, for example, 
an electron beam — just as in any optical theory the square of the 
quantity which satisfies the wave equation is a measure of the light 
intensity, without necessarily requiring that it be a component of the 
electric vector. 

As is always done in the wave theory, we take a simple periodic 
function to represent the dependence on time: 

T = \pe 2nivt = i/ieP ri ' EI *‘ t , .... (27) 


and obtain for the variation of i/j with position the fundamental 
Schrodinger equation f 

V)<!>= 0 (28) 

This equation has the same significance in atomic mechanics as has 
the Hamilton- Jacobi equation 

® (!•»*)- * " 

in celestial mechanics. 

* See F. Knauer and O. Stern, Zeitschr. f. Phys.,5Z, 11-12, p. 779 i 19 ^ 9 h ^ ^ 
f E. Schrodinger, Phya. Rev., 28, p. 1049(1926); Collected Papers on Wave Mechanics ; 

pp. 2, 77. 
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4. The Proper Values of the Wave Equation* 

The wave function *F governs the motion of the electrons in the 
same way as light waves determine the motion of the photons — 
wherever *F vanishes there are no electrons. If, then, we seek a solu- 
tion which is to represent the motion of an electron in an atom, 'F 
must certainly vanish at infinity, for the bound electrons are all con- 
fined to the finite region. In addition, we must demand that any 
physically useful solution is to be a continuous and a single-valued 
function of position in the finite region. Simple as these conditions 
seem, they are, nevertheless, sufficient to determine the characteristic 
values which in the old quantum mechanics were singled out from 
among the many mechanically possible solutions only by imposing 
an auxiliary condition. It may be shown that the condition of single- 
valuedness is identical with the old quantum conditions for a suffi- 
ciently large periodic orbit. To demonstrate this we shall confine our 
attention to systems with one degree of freedom. As a matter of fact, 
the extension to more degrees of freedom — and hence, in general, to 
multiply-periodic systems — does not present the slightest difficulty. 
Let us consider a closed orbit to be the orthogonal trajectory of wave 
surfaces. Then if we move along an orbit, *F must return to its former 
value when we again reach the starting-point, in harmony with our 
requirement that *F is to be single valued. This means that the phase 
of the wave motion must have increased by an integral multiple of 2n t 
and this in turn implies that a whole number of wave-lengths must be 
contained in the path. In general, the wave-length may vary from 
place to place if the index of refraction is variable. We may therefore 
express the condition of single-valuedness as follows: 


m{E— U) dq— n. 



But according to the Hamilton-Jacobi equation (p. 127), we have 

1 / dS \ 2 a n * • dS y - 

2 m\dj) +* 7== -®’ ie * P = V2m(E - U). (31) 

We thus obtain the old quantum condition 


j)pdq=nh . 


• • 


(32) 


f ’ £ ^ that tks mereI y secures a more reasonable 
foundation for the qiiantum conditions without altering the result 

m the least. At the same tune it must be remembered that the 

oSvVtlT r dlfa0a “ giV r a f ignifioant meaning by equation (30) 

m • wave ‘ 1 . en ff fch 13 s ° gradual that the whole orbit 
can be divided into intervals dq, within which several equal wave- 
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lengths lie. Since this condition is by no means always satisfied, it is 
quite possible that a strict calculation based upon the wave equation 
may lead to somewhat different results. 

The theory of linear differential equations shows that the con- 
dition of single-valuedness, or the boundary conditions, can be satis- 
fied only for certain values of the parameter E. These are called the 
proper values , or characteristic values ,* of the differential equation 
(cf. the simple cases discussed on p. 186 et seq.). Our chief task, 
therefore, is the determination of the proper values for a given 
mechanical problem; these are, by their very nature, the selected 
energy levels. The corresponding characteristic functions , or proper 
functions , on the other hand, are only of secondary importance. We 
shall illustrate the process by two examples — the hydrogen atom and 
rotational band spectra. 

5. The Hydrogen Atom. 

Since the potential energy 

V = — - (33) 


depends only upon r, the co-ordinates appropriate to the 
problem are spatial polar co-ordinates. Denoting the electron mass 
by m 0 , the Schrodinger equation (28) (p. 696) becomes (cf. p. 268) 


2 0*1 1 J_ i/ sin ^A 

0? r 07 rMsinfl 00V dd ) 



1 d*4>\ 

sin 2 0 d(f) 2 ) 



We try to solve this equation by a value of «/» which is the product of 
separate function? of r, 0 and <£: 

0= rt(r) 0(0)0 ($ ( 35) 

So far as (f> is concerned, we can have only the function 

O = e ±im * } ( 3 ^) 


or a linear function of cos m<t> and sinm^with integral values of m on 
account of the single-value condition. Whatever function we take. 

we have always 

= _ m 2f?0O ( 37 ) 

•Gor. EigenmrU, Eigenfunktionen ; Fr„ constant fondamentales, fonctions fond* 
menlales. 



XL.] 

Eliminating 0, we obtain 1 
upon r and 6: 

t & 4- 2 dR I 87T * m ° 

R { dr 2 r dr h 2 

0 


THE NEW QUANTUM MECHANICS 


699 


sin# dd 



m 2 0 ) 


(38) 


which we have written in such a way that the left member contains 
only functions of r, the right member only functions of 6 , so that each 
member must be a constant, say a, so that the variables are separated. 
First we examine the equation involving 6: 

d + (“ ~ iS*) 0 = °- • 


sin0 dd 


(39) 


This equation is familiar as the differential equation of spherical har- 
monics. For details, see the Mathematical Addendum, § 4 (p. 819). 
We substitute 

cos 0 = x, (40) 

and write in place of 0(cos 6) the more common symbol y(x ) ; there results 


(1 - x*) 


<Py 

dx* 


_2i 2 + (° 




As is shown in the Theory of Spherical Harmonics, this equation has 
solutions which satisfy the conditions of single-valuedness and con- 
tinuity if a is of the form 


a=(Hffi)(H«i+l), . . # . (42) 


where k and m are whole numbers. We set k + m = l and call the 
solutions corresponding to a = 1(1 + 1) spherical harmonics of Jth 
order. There are l -{- 1 different functions of this kind, since m can 
assume all values from 0 to 1 For m = 0 we obtain the so-called 
surface zonal harmonics or Legendre polynomials. These are simple 
polynomials of degree l, and are most readily obtained as the coefficients 
of the powers of r in the expansion of the expression 


This gives 


F(r, *)=(!- 2 rs+r 8 )-!/*. 


*0=1 

* a =J**-J 


*3=|^ - 


I* 


p 35 . 




T* a +S 

? * + j m, &o 
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As one may easily verify, the polymonial of degree l is a solution of 
that differential equation for which the value of a is 1(1 +1); for 
example, P 6 corresponds to a = 30. When the value of m is different 
from zero we again obtain polynomials as solutions. These are called 
the associated harmonics, and are related to the surface zonal har- 
monics by the equation 

P, m (x)=( 1- **)»/* .... (44) 


Since the Zth derivative of P t is a constant, the spherical harmonic 
belonging to m = l y which corresponds to the highest azimuthal sub- 
division, is 

Pi l = K(l-x*)'l*= K(smey. . . . (45) 


We have still to determine R as a function of the radius r. If we sub- 
stitute the value a = l[l + 1) in the separated equation (38) (p. 699), 
the differential equation for R becomes 


d*R 
dr 2 




/8t T 2 m 0 E 



87T 2 w 0 e 2 

K l r 


1 ( 1 + D\ 

r 2 ) 




We introduce the abbreviations 


— - 1 ( 1 + 1 ) 


= C, 


4-77 2 w n e 2 

h 2 “ 


= B 


and obtain 




dr 2 


. . ( 47 ) 

. . (48) 


The nature of the solutions depends essentially on the sign of A, and 
hence of E , corresponding to the case of an elliptic or a hyperbolic orbit, 
as may be seen from the behaviour of the equation for large values of 
r. In this case the terms not containing 1/r dominate, so that the 
equation may be replaced approximately by 

^ + AR = 0 (49) 


According to the sign of A we have either an oscillating function as 
a solution or one which tends to zero or infinity exponentially. 

Consider first the case when A, and therefore E, is negative. It 
turns out that this corresponds to the case of an elliptic orbit, bet 
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where A must have the dimensions of r -2 , in order to have AR of the 
same dimensions as d 2 R/dr 2 . In addition, we set the dimensionless 


r _ = P 
To 2 



In this notation, the asymptotic solution is, according to (49), 


R = Ke -» 2 



The positive sign of the exponent is excluded if the solution is to have 
a physical meaning. We try the solution 

R=e-^v(p) (53) 


for the finite region and thus obtain the differential equation 


dp 2 


+ 


(H 


dv 

dp 


+ 




which we again try to solve by means of a polynomial.* The theory 
of differential equations shows that the lowest power whose coefficient 
does not vanish is the Ith. in our equation. If, then, we write 

v= pho(p) = p l (a 0 + ajp + . . . a v p v + ...), . (55) 


and use this expression and the derivatives obtained from it in equation 
(54), we obtain the recurrence formula 

Up+l+l)(p + l)+2(p + l+l)-l(l + l)] av+1 

— [p + 2 + 1 — B]a„. . (56) 

The series terminates with the term a v p v if 

roB = p + l+l = n, (57) 

where « is any positive integer; i.e. by (50), if 

B 2 = —n 2 A (58) 

Hence, by (47), the possible values for the energy are 



27r 2 m 0 e* 



But these are precisely the energy levels of the hydrogen atom which 
were obtained in the old quantum mechanics on the basis of the Bohr 
postulates. Corresponding to the spherical harmonio of 1th order 
then, the lowest value of » is n = I + 1. This corresponds to 


ment'rf ^ terminates is a sufficient condition for the fulffi. 

teStaMrito. 01 emgle-valuedness. That it is also a necessary condition cannol 
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the empirical rule of the old theory according to which it was 
necessary to count the number l from 0 to n — 1 instead of from 
1 to n. In the same way, the (22 + 1) different values of w, which 
we interpreted as the number of integral projections of l , are here 
found to be the number of surface harmonics P{*(x)e ±im * which, by 
equation (44) (p. 700) correspond to a given numerical value of l. 
The factor 2 results from the double sign of the exponent which, 
however, gives but one value for m = 0. The proper function corre- 
sponding to the proper value E n is 


«/'»»= e p/ V M, n -i- 1 (p) Pi m (cos 0)e ±,m *, . . 



where the degree p of the polynomial w v is expressed in terms of n 
and l. The factor e~ pl2 guarantees the vanishing of the </> function at 
infinity. Since we wish to compute with real ip functions, we have not 
put for O. A second solution, corresponding to the same proper 
value, is the same expression with sin w</> replacing cos m<p. 

There are n 2 different proper functions corresponding to each 
definite proper value E n , for l runs from 0 to n — 1 for a given value 
of n, and to every value of l there are, by the above, 22 + 1 surface 
harmonics, so that the number of possible proper functions is given by 

z„=''i , (2J + l)=+l + 2»-2 + l) = n*. . (61) 

0 z 


The case where several proper functions correspond to one and the 
same proper value is what we refer to as a case of degeneracy in 
celestial mechanics, since the number of independent frequencies is 
smaller than the number of degrees of freedom. For this reason we 
also refer to such cases in wave mechanics as degenerate. The degree 
of degeneracy, however, is different here — if z proper functions are 
associated with one proper value, we say the degree of degeneracy 
of the system is (2 — 1). 

With regard to positive values of E , which correspond to hyper- 
bolic orbits, we may mention without proof that no discrete proper 
values exist in this case — every positive value of E is compatible 
with our requirement that the solution should be one-valued. This 
corresponds to the continuous spectra found at the series limits; 
formerly their existence was explained in an analogous way by point- 
ing out that the kinetic energy with which an electron departs from 
the atom is not subject to a quantum condition. 


6. Rotation Spectra of Diatomic Molecules. 

For simplicity we shall consider a diatomic molecule, one of whose 
atoms is of infinite mass, the other being of mass M and at a distance 
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a from the first. The co-ordinates appropriate to the problem are 
again spatial polar co-ordinates. We write 

a 2 M = l , (62) 


and since there is otherwise no dependence on r and no potential energy, 
the wave equation simplifies to 


U sin0^ 


+ 


sin# 90 \ 90/ 1 sin 2 6 d <f> 2 


9 2 *A , 8 tSIE 

i ft— = 0. 


h 2 



This equation holds quite generally if we take I to be the moment ot 
inertia about an axis through the centre of gravity normal to the 
figure axis. Comparing this equation with equation (34) (p. 698), 
we see that we again have the equation of surface harmonics, whose 
proper values are given by 


WIE 

h 2 


= 1(1+1), 


or 


P - *P + l)* a 

8t r 2 I * 



But this is precisely the form given in § 12 of the preceding chapter 
(p. 682) for the rotational energy levels of diatomic molecules; the 
old quantum theory gives this incorrectly as W/8W 2 /.* Thus the 
newer theory justifies the curious rule of the replacement of x 2 by 
-f 1). In general, the result is that wherever the older atom me- 
chanics yielded x 2 , the square of spin quantum numbers, these are to 
be replaced by x(x -j- 1), the so-called “ quantum-mechanical square 


7. The Physical Meaning of the Y Function. Intensity Relations for 
Spectral Lines. 

Schiodinger originally interpreted his Y function in the following 
way. The electron is not a localized charge within the atom, but 
charge and mass are “ smeared ” over a certain region. The density 

of charge is assumed to be proportional to YY, the “ norm ” or 
square of the amplitude of the Y function. Assume first that only 
a single state of the atom, corresponding to the proper value E„ is 
developed. The corresponding Y function is 


'Vm = 


(65) 


Since <jj m is real, we see that on multiplying by Y_ the time factor 
disappears, so that the distribution of oharge is constant in toe 

Bohr^Ste. ° f ^ W 08111101 r8diate ’ “ a S leement with the first 


Writing l instead of m. 
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For what follows it is necessary that the ip functions be ortho- 
gonal, and normalized to unity ”. By the latter expression we mean 
that, since there is an arbitrary multiplicative constant in the solution 
of the differential equation, we can choose it so as to make the integral 
of the square of any proper function ip m , taken over the entire region, 
equal to unity. W e thus specify that all proper functions are normalized, 
i.e. 

ftJdr = l (66) 

The orthogonal property implies that the integral over the entire 
region of the product of two proper functions belonging to different 
proper values is zero: 

J ‘/'m^ndr = 0 (07) 

The proof is based upon Green’s Theorem (cf. p. 270): 

J {'I'mA'Pn — = <f (‘Am g™d lp n ~ *Pn glad (68) 

The proper function j p m satisfies the equation 

+ S -^-° (E m - U) = 0, . . . (69) 

and \p n satisfies the equation 

A</,„ + (E n -V)4>„ = 0. . . . (70) 

If we multiply the first equation by ip ni the second by tp m , and subtract 
the first from the second, we obtain 

- 4>,AK = —p ( E m - EM„. . ( 71 ) 

This equation is multiplied by dr and integrated over all space. The 
integral on the left is transformed into a surface integral over an 
infinitely distant boundary by means of Green’s Theorem (68). Since 
the proper functions vanish exponentially at infini ty, this integral 
is zero, and the orthogonality is demonstrated. 

On account of the linearity of the wave equation any linear com- 
bination 

T = CqTq + + . . . c,.T* = c 0 ip 0 e 2iriE,,h ‘ + c^>^ iE ' ,h ‘ + . . . (72) 

is also a solution. Schrodinger interpreted this as an atomic state m 
which several natural frequencies are simultaneously developed, the 
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amplitude factors c k being a measure of the excitation, 
this solution to obtain the expression X F'F we obtain 


If we use 


2tt 


'F'F = c 0 Vo 2 + Vi 2 + • • • c k 2 *Pk 2 + ZcoCii/jQip! cos y {E x -E 0 )t-\~. 


• • 


2t r 


+ 2 Ctfitfaifri cos y (E t — E k )t 


(73) 


Thus the density of charge is composed of a part which is constant in 
time and another part whose magnitude oscillates with the frequencies 



This variable charge, together with the nucleus, represents a variable 
dipole moment which, in accord with the laws of electrodynamics, 
emits light of frequencies v kt . When averaged in time, the contributions 
of the variable terms to the charge density vanish, and since the total 
charge must be constant, the amplitude factors must satisfy the 
auxiliary condition 

= l (75) 


The dipole moment, from which the emission may be calculated accord- 
ing to p. 341, becomes 

j>ki = —2 c k c x e f rfatpidT, . . . . (76) 

where r is the radius vector drawn from the nucleus to the volume 
element dr. The second Bohr postulate, by means of which the 
quantity h was introduced a second time without justification, is 
thus fulfilled automatically, on account of (74), in the Schrodinger 
formulation. 

Although the Schrodinger interpretation must be replaced nowa- 
days by a statistical view, it gives, nevertheless, a very vivid derivation 
of the dipole components which determine the intensity of spectral 
lines, viz. 

*! u= e f ...... (76') 

These components are identical with the “ matrix elements ” whioh 
determine the relative intensities of spectral lines in a quite different 
theory developed by Heisenberg* In particular, the matrix theorj 
gives the old selection principles correctly. 

The Schrodinger interpretation of the X F function met unsur- 


* Heisenberg, ZtiUchr. f. Phya., 38, p. 879 (1926). That the wave and matrix 
Schrodinger mathematloall y equivalent was shown independently by Eokart and by 


24 


(l 711) 
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mountable difficulties when an attempt was made to extend the reason- 
ing to several electrons. In this case ip depends upon 3iV co-ordinates, 
and the quantity ip is determined, not as a function of position in actual 
space, but in the 3.ZV-dimensional configuration space. In place of 

the Schrodinger interpretation we then have a statistical one; 'F'F is 
a measure of the probability of finding an electron in the volume 
element dr, or — for more than one electron — the probability that the 
electrons lie in the 32V-dimensional volume element of configuration 
space. We can then average over a large number of atoms and obtain 
a mean distribution of charge whioh corresponds to the Schrodinger 
assumption. The chief difference between the original formulation 
and this new one due to M. Born is, however, that, quite in keeping 
with the principles developed in § 2 (p. 690), we continue to think of 
the electrons as points and bring in the waves to take care of the inde- 
terminacy. But since observation always gives an average over many 
atoms and fairly long times, we may in many cases treat the situation 
as though we were really dealing with the instantaneous distribution 
of charge for a single atom. The representations of charge density 
given by many recent papers are to be understood in this sense. 

There is another difference in the two formulations that refers to 
the radiation process. According to (76), it is necessary that several 
proper functions be excited in order to attain a variable dipole moment. 
Translated into the language of statistics this means that in order to 
have emission, not all atoms can be in the same excited state. In 
reality the e mis sion of radiation demands only that the excited state 
be occupied, and (76') then gives the transition probability, which is 
independent of the occupation of the lower energy states. 

The continuous X-ray spectrum is also explained in this way by 
transitions between the closely spaced states for which E > 0 (cf. 
p. 702). The largest jump yields a limit frequency v L = EJh, where E v 
is the initial energy of the electron. 



CHAPTER XLI 

Atomic Problems first solved by the Wave Mechanics 


1. The Theory of Perturbations in Wave Mechanics. 

The wave mechanics not only yields the results of the Bohr theory 
— sometimes in an improved form, e.g. in the rotational term in band 
spectra — but succeeds in indicating the solutions of a number of 
important problems in face of which the Bohr theory was powerless. 
Of these we may mention the problem of several electrons (especially 
the helium problem), dispersion, non-polar chemical binding and, 
finally, the problem of nuclear disintegration (radioactivity). The 
concepts of wave mechanics throw new light on these questions. 

A rigorous solution of the differential equation of the problem of 
n bodies is no more possible in wave mechanics than in celestial 
mechanics. In the former we must also apply methods of approxi- 
mation, starting, just as in celestial mechanics, from the simple Kepler 
problem. The mathematical results needed for this perturbation theory 
will be developed in this section. 

Let us add a perturbing term to the potential energy in the wave 
equation. For an atom, for example, this would be a series in descend- 
ing powers of r, corresponding to a core potential. We therefore set 

U = U° + 17', (1) 

where U° represents the potential energy of the unperturbed system. 

It C/ is small, the proper values of the perturbed system will differ 

but little from those of the normal system, and we may therefore 
taice 


K k = + 6 *. 




In the same way, we can relate the corresponding proper function to 
tnat of the normal system by setting 

'P = fa 0 + 0*' (3) 

We then have for the Schrodinger equation of the perturbed system 
AW> + +U-UO-U') W + M =o. (4) 


Neglecting the products of perturbing terms and making use of the 
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fact that ip k ° satisfies the Schrodinger equation whose parameter is 
E k , we obtain the following equation for i/»': 


A'frk + 







The left member of this non-homogeneous differential equation is 
identical with that of the equation for the normal system; hence for 
vanishing perturbation this leads both to the proper value E k l 
and also to the proper function i p k °, as it must do. Now the theory 
of differential equations shows that a necessary condition for the 
existence of a solution of the non-homogeneous equation is the ortho- 
gonality of the perturbation function on the right to the solution of 
the homogeneous equation. This is proved by Green’s Theorem, as at 
(67) and (68) (p. 704). Hence 

/"(«*- U-)(W?dT=0 (6) 


But this is an equation determining the change in the proper value 
caused by the perturbation, which is what interests us primarily. Since 
«/r t ° is normalized, this yields 

«» = fu' OW) 2 dr (7) 

This result corresponds to the theorem in celestial mechanics which 
states that, to a Erst approximation, the value of the perturbational 
energy is equal to the perturbation function averaged over the normal 
orbit. 

If the perturbations of the proper values are known, the perturbing 
terms of the proper functions also may be determined by a develop- 
ment analogous to Fourier’s series. That is, under the very general 
continuity conditions — which are always fulfilled in a physical problem 
— any function may be developed in a series of proper functions. If, 
for example, we set 

f[i, y. z) — a l t/»j 4- a 2 0 2 “h • • • a + • • • t • (8) 

we obtain (cf. p. 189) the coefficients exactly as for Fourier series by 
multiplying both members by </» m and integrating over the entire 
region. Because of the orthogonality, all terms with subscript different 
from m vanish, and there remains 

J f(x. y , z)(Jj m (It = n m J 0 m 2 dr = n„. . . . (9) 

Accordingly, we develop the right side of equation (5) according to 
proper functions 

_^P (€k _ £/')</r*°= Sa m ^ w °, . 


. . ( 10 ) 
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where a m = f W — **) ip k °ip m ° dr =j= 0 for k =\= m. 


( 11 ) 


= 0 for k = m. 


Similarly, we suppose the function i/r' to be written as a series of the 
type: 

( 12 ) 


These series are substituted in equation (5). Then, since the following 
relation holds for ip m °: 



8 ^o p o 
h 2 L ‘ m » 


comparison of the coefficients of each i fj m on both sides of the equation 
yields 



h 2 


a 


m 


8^m 0 (E k o - E m °)' 



Thus the condition of orthogonality, (6) and (11), prevents the occur- 
rence of terms with va nishin g denominator. 

A complication arises here, as in celestial mechanics, in the very 
frequently occurring case in which the unperturbed system is de- 
generate, i.e. when several proper functions correspond to one and 
the same proper value. We say that the system has a degree of 
degeneracy (a — 1) if a linearly independent proper functions corre- 
spond to E k °. Every linear combination of these proper functions also 
represents a solution of the normal system. As may be seen from the 
derivation on p. 699, the proper functions corresponding to the same 
proper value are not, in general, orthogonal. However, by means of 
a linear substitution we can always set up linear combinations of the 
original proper functions in such a way that the proper functions 
corresponding to a single proper value also have the property of 
orthogonality, in fact, we still have a number of constants whose 

values we must determine when the perturbing force is applied 
If we put ’ rr ' 


W = 4. a k 12 tp k 2 + + . . . + «*»•**• 

fl = “W + V + «*“*»' + ... + 1, (14) 

0*‘= «*■«*»* + a/**,* + •»■»*,* + . . . + a*-*,- 

we have a a coefficients, while the number of equations to be satisfied 
is a + a(a - l)/2, viz^a equations involving and a(a — 1 )/£ 

equatwns invoking f^dr. Thus we have still a(a-l)/2 co 
efficients free. But this is the same as the number of independent 
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coefficients of an orthogonal linear substitution, i.e. a substitute 
whose coefficients b k ih satisfy the equations 


Zb k ih b k “ = 0 for h =f= 1' 
= 1 for h = l 


• • 


. . 05) 


lms transformation applied to the co-ordinates of a point signifies 
the rotation of a set of three rectangular axes. Actually, the ortho- 
gonal property of the proper functions is not disturbed by making 
such a substitution, as may be verified at once. We make the a priori 
assumption that the a initial functions satisfy the condition of 
orthogonality. If the degeneracy is removed by the perturbation, i.e. 
if the single proper value of multiplicity a is split into a different 
proper values E 0 -(- €*, it is still undetermined which of the possible 
linear combinations of proper functions yield the proper functions of 
the perturbed system. We take this into account by forming the 
following linear combinations of the functions ip k l , i/r* 2 , fa*, which are 
already assumed to be orthogonal: 


•A* 1 * = VW + W* 2 + ... b k "<p k * ' 
W* = VW + VW + . . . 


«Ait a# = W * 1 + V 2 «A * 2 4- . . • b k aa tp k a ) 


. . . (16) 


The coefficients are to satisfy the condition (15). 

According to the theory of differential equations it is necessary, 
in order that a solution exist for the perturbed system, that the right 
side of equation (5) (p. 708) be orthogonal to all of the linearly inde- 
pendent solutions of the unperturbed system. If we insert a linear 
combination ip k ** in the perturbation term, we obtain from the 
equations 

— U ') ip k '*dT = 0 


/_ 

/«Afc 2 (e* — U')'frk i * dr = 0 
• • • • • 

f '/'*“(**— U')'t>k i *dT= 0 


(17) 


the a proper values E -f- € k as well as the a initial functions (approxi- 
mations of order zero). To show this we write the abbreviation 

/ U'WWdr= e k ih = * k hi , .... ( 18 ) 

and substitute the values of «/>*** from (16) in (17). Then since U 
is assumed known as a function of position and since the functions 
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ip k h are assumed known, we obtain a system of equations for the 
coefficients b k ih , which we may look upon as unknowns. The system 




11 


— €*) + b k 12 e* 12 -h . . . 6 fc la e fc la = 0 

V 1 ** 21 + -€*)+... = 0 

»•••••••« 

h* u *k al 4- b k 12 e fc ° 2 -f . . . b^ie^ — e & ) = 0 J 


(19) 


corresponds to Since the equations are linear and homogeneous, 
the existence of a solution differing from zero requires that the de- 
terminant of the coefficients should vanish. This yields the so-called 
“ secular equation ” 


(«* U - €j 


12 


• • 


€ i. 


21 


(<* 22 - 


• • 


€ k 


la 


2a 




al 


a2 


• • 


(** aa “ €*) 


= 0. . (20) 


This equation is of degree a and so has a roots. Because of the sym- 
metry of the coefficients all roots are real. We again encounter this 
equation for each solution xjt k l *. Substitution of one of the roots e k r 
yields the coefficients b k ih after solving the system of equations. This 
then gives the zero order approximation to the perturbed proper 
value, from which the proper function corresponding to the value 
+ ** develops when the perturbation is applied to the system. 
If the secular equation has several equal roots, this means that the 
degeneracy has not been completely removed by the perturbation. 
Then, without the addition of further perturbations, each linear com- 
bination of the proper functions associated with the multiple roots 
agam represents a solution. The new proper functions thus found form 
an orthogonal set because of the symmetry of the coefficient of the 

secular equation. This represents the multidimensional extension of 

the fact that the axes of the second-degree surfaces are mutually per- 
pendicular (cf. Ex. 18, p. 37). J * 

The perturbation theory in wave mechanics is confined to perturb- 
rng forces which have a potential. It is necessary to extend the 
Schrodinger equation m order to be able to compute the proper values 

P'4 Ulbatl0n3 ' he- those aot derivable from a potential. 
JE m ‘his stance we ar , e able to derive only the normal Zeeman 
™ it f° n ^ by a Sy oom P hcated generalization of the wave equation 

P°“ lbl ® obtam the additional terms corresponding to 

by Dirac, but cannot be discussed in detail here 6 
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2. The Chemical Bond. Formation of the Hydrogen Molecule. 

As an example of the application of quantum perturbation theory 
we deduce the solution of one of the most important problems in atomic 
physics— the nature of the chemical bond. There is little doubt re- 
garding the nature of the forces holding together polar compounds, 
which are composed of positive and negative ions, as exemplified by 
salts. The forces in this instance are those of electrostatic attraction. 
A more difficult question is that concerning the cause of the limi- 
tation of the attractive forces, so that the oppositely charged ions 
remain at a definite distance from each other — in other words, what 
is the source of the repulsive forces which appear when the ions are 
brought close together ? The wave mechanics offers information on 
this point. If we imagine the positive ions replaced by protons, they 
will, of course, be attracted by the negative ions, provided that the 
distance apart is not too small. Closer approach will ultimately cause 
the proton to enter the electron “ cloud ” surrounding the negative 
ion. The ions of a simple salt possess closed electron shells of the rare 
gas type, so that the electrons are symmetrically disposed. According 
to a fundamental law of the theory of the potential, only that part 
of the electron cloud which is within the surface of the sphere upon 
which the proton lies exerts any influence on the proton. Thus the 
proton will soon come to a point where the repulsion of the nuclear 
charge preponderates. Since, by p. 702, the density of charge de- 
creases exponentially with distance from the centre, the repulsive 
forces must increase in this manner. Thus the simple law of force 
containing a single exponent is valid only at certain distances, and is 
merely a makeshift for the true law of force. This is manifested by 
the variations in the empirical value of the exponent of the repulsive 
force. 

A much more difficult question to answer is that concerning the 
nature of the forces holding two atoms of the same kind together. The 
simplest instance of such a non-polar bond is given by H 2 ; a more 
complicated instance is the bond between the carbon atoms in diamond. 
A method of explanation was first indicated by the wave mechanics 
through the introduction of exchange forces or, better, exchange energies. 
Heisenberg was led to this concept when computing the large difference 
in energy between the triplet and singlet terms of helium which — 
according to atom models — differ only in having the spins of the two 
electrons parallel or antiparallel. The small magnetic energy of two 
magnetons is not sufficient to account for the difference. If the elec- 
trons are distinguishable from each other, a given solution will always 
be accompanied by a second one belonging to the same characteristic 
value and in which the two electrons are interchanged. In this sense 
the system is degenerate and each linear combination also constitutes 
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a solution. Here again, as in § 1, we must seek those combinations 
from which the new characteristic values develop by the addition of a 
perturbation. The mutual electrostatic energy of the pair of electrons, 
which was previously neglected, constitutes the perturbation in this 
instance. Proceeding in this way, there results a large difference in the 
perturbations of the characteristic values for the two linear combina- 
tions that correspond to parallel and antiparallel spins. We can 
acquaint ourselves with the method by considering the bond between 
two hydrogen atoms: 

Consider two hydrogen nuclei (protons), a and b, at a large distance 
R apart. Let there be also two electrons. Assume the distance of elec- 
tron 1 from a to be r al — -from b, r bl . Electron 2 is at a distance r a2 from 
o, r 62 from b. Call the distance apart of the two electrons r 12 . If the 
nuclei are held fixed, the Schrodinger equation for the system is 


aty . av a 2 ^ a 2 0 

dxf + d Vi 2 + + + dy 2 2 + dz 2 

^ l r ol r t>2 r l2 r a2 r t»l/i 



Assuming that electron 1 belongs to nucleus a , electron 2 to nucleus 6, 
the quantities within parentheses are perturbing terms. Omitting 
these terms, we can arrange the equation as follows: 


-f 


87 rhn 


( E ' + 0 


*1* 4“ A 2 »/> -j- 


S-'bn 

~if r 


( E * + £) ^ = °' (22) 


where 


E — E l ^ 2 * 


Each pair of terms set equal to zero represents the Schrodinger equa- 
tion of the hydrogen atom. If, according to equation (60) on p. 702 
we therefore take the normalized solutions for the hydrogen atom in 



then 


0= ■/>-( 1)^(2). 


(24) 


> !,e ' — ■ iE o- certainly is a solution of the equation 
of the combined system from which the interactions have been 
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omitted.* This may be verified at once by substitution in equation 
(22). But we can assume with equal right that electron 1 belongs to 
nucleus 6 and electron 2 belongs to nucleus a. This leads to a second 
solution which corresponds to the same characteristic value 2 E 0) and 
which has the form 


0 = 2)<M1) 



The original problem — that of two hydrogen atoms assumed to con- 
stitute a single system — is thus singly degenerate; each linear com- 
bination ip 0 ' and tp 0 " of the two proper functions also represents a 
solution : 


= cx0 a (l)«A & (2) H- ^(2)^(1) 1 
0o ,/ =y*/'o(l)^(2)+8^ a (2)0 b (l) V 


Four coefficients are at our disposal. Three of these are deter- 
mined by the condition of orthogonality and the normalization 
to unity: 

fWdr= yV*ir-l; f*W dr = 0. . (27) 

According to the procedure in § 1 (p. 707), the fourth constant and the 
correction term e in the energy are determined by the condition that, 
after inserting the first approximation i/r 0 + «Ai> the right member of 
the resulting non-homogeneous equation must be orthogonal to all 
solutions of the homogeneous equation. Since we have two solutions 
here, the secular equation is quadratic in e, yielding two roots which 
we designate by e 3 and c a . The approximation of zero order corre- 
sponding to <r 3 , from which the perturbed proper functions are 
developed, results after a lengthy calculation: 

&•=- r= * - {«Ao(i)«M2) + 0o(2)0»(i)}; • 

ro V2 + 2aS 


• The value of the multiplicative constant K corresponding to normalization to 
unity is not to be obtained from equation (60). It must be separately computed from 

the relation 

J K'pdr - K*J J e-*l r » r 2 dQ.dr - 1. 


from which it follows that 
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^o a = 


V2 + 2 S 


{*.U)*.P)- lM2)<MU}, . (29) 


where S is an abbreviation for the volume integral 

Thus the proper function which is associated with e, is symmetric in 
both electrons, i.e. it is not altered by interchanging 1 and 2; the 

other solution — that corre- 
sponding to e a — is antisym- 
metric, i.e. its sign changes 
if the two electrons are inter- 
changed. The computation of 
the roots of the secular equa- 
tion is somewhat troublesome, 
so that only the results are 
given here in the form of two 
curves which exhibit the value 
of € as a function of the para- 
meter R (fig. 1). The impor- 
tant feature is that the sym- 
metric solution has an energy 
m i nimum at a definite distance 
from the nucleus. This solu- 

, , , . , , _ , .. , tion therefore corresponds to 

the chemical bond , while the other solution, for which the additional 
energy is always positive, corresponds to elastic rebound. 

The wave mechanics was first applied in this manner by Heitler 
and London It gives a surprisingly accurate value of the heat of 
reaction m this case, considering that only the zero order approxi- 
mation was computed. But in addition, this method is capable of yield- 

possibility of forming certain compounds if 
P 1 UL Pnn f pl ° f t0 the ol wave mechanics. 

four qusntmn numbers. In wave mechanics, this means that the ! 

bS 8ymi f^ io the co-ordinates of two electrons^ 
Tins might cause one to believe that the symmetric solution for the 

5JT rW • " • e f oluded ' 14 CemeLZd howevt 
this pomt the electron spin has not been includeT^lhe wave ^ 

• G. Heitler and F. London, ZeilrcAr. /. Ph v ,„ 44 , p. 4M (1B27>. 
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chanics. According to Pauli’s rule, two electrons having opposite 
spins may agree in the remaining quantum numbers. Thus, taking 
mto account the doubling of the possibilities resulting from the spin, 
we can adopt Pauli’s Principle by saying that the ip function of an 
atom or molecule can at most be symmetric in certain pairs of electrons, 
i.e. it may be such that when electron x and electron y are interchanged 
the function remains unaltered, but in that case not when x and electron 
z are interchanged. The electrons in which the ip function is symmetric 
have opposite directions of spin. For the helium atom, for example, 
the proper function for the ground state is symmetric in both 
electrons, corresponding to the equal quantum numbers n, l and m v 
The symmetric function for two atoms would then be symmetric in 
four electrons, which is impossible according to Pauli’s Principle. 
But the solution corresponds to that of the chemical bond even in 
this case. The symmetric proper function resulting from the com- 
bination of two electrons which , in the individual atoms , have no partners 
whose three quantum numbers are the same , but which have oppositely 
oriented spins y represents the interpretation of chemical valence on the 
wave mechanics. 


3. The Theory of Dispersion and of the Raman Effect in Wave 
Mechanics. 


We now consider the behaviour of an electron in the rapidly alter- 
nating electric field of a light wave. Here, as in the classical dispersion 
theory, we neglect the magnetic forces. The perturbing potential con- 
tains the time explicitly. The previous form of the wave equation is 
not suitable for treating perturbation problems of this kind, for it 
already contains the characteristic parameter, and hence there is a 
separate differential equation for each state. It is not difficult, how- 
ever, to eliminate the parameter E. It follows from our solution 


that 


hence 


T* = iPe 2 ” iElh r , 

0T* 2niE 


dt 


E = 


X V\ 

n 

h_ dY 1 

2iri dt 


(30) 

(31) 



Substituting this in the actual wave equation and remembering that 


3 2V F 4tt 2 £ 2 

dfi ~ h 2 




2m (E — U) d 2 T 

& 0< J 


we obtain from 
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the form 




4 Trim 3T 

~1T Jt 




which no longer contains a characteristic parameter. 

Let the electric field of the incident light wave be along the z-axis 
and of amplitude A. The perturbing potential is then 

V = — eAz cos 277^, (35) 

and the new form of the wave equation is 

.... 87r 2 w /r70 . . JV , T . 47 rim 3T . 

AT (U° — eAz cos27r^)T r- = 0, 

rr n ot 

or 

AT - 8 ~ t/°'F _ 1^? eiz(e ! "'' + T. (36) 


We may substitute the unperturbed fcth proper function T* 0 = 
ip k Q e 2nil 'k i for X F in the perturbing term on the right.* We then have 



87r 2 m 


U°'¥- 


47rt’ m 3T 

h 3 1 


A 2,w i 

= eAz{e 2nill k+^ t -f 


(37) 


It is natural to take as a solution the sum of the normal solution and 
two other terms of frequency v k -f- v and v k — v respectively, i.e. 

T = T fc ° + ip + e 2 ^h+^ -f # t ( 38 ) 


Substituting this in equation (37) and noticing that T*° causes the 
left member to vanish, we obtain two differential equations — one for 
0116 for ip _ — which we may write as the single equation 


4- ( E k ± hv— U°)*p ± 



This equation is again of the form of the perturbed Schrodinger 
equation which was treated in § 1 (p. 707). It differs from the former 
equation, however, by the presence of the term +fcv. This has an 

i t ^ ^ the homogeneous equation 

has no solution whatsoever. The parameter is E k ± hv, and a solution 

enats ooiy for the value E k . The theory of differential equations shows 

tnat the non-homogeneous equation always has a solution in this case 


„ • 40 °°i pro “ dure in 5 '■ "■<> now omit the superscript 0 on fcV and 

^eter there oan be no doubt here that these quantities refer re the unperturbed 
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The condition of orthogonality which must be satisfied when the 
homogeneous equation has a solution, in order to prevent the ap- 
pearance of the term with vanishing denominator E k — E k , is absent 
here. As for the rest, we can then proceed according to the method 

• • # § 1: we expand the function ztp k ° of the right member 

in a series involving the proper functions 

zi/j k °= Za nk ip n °, (40) 

On* = + Jz'Pk i 'l>ndT, (41) 

and write ip+ and «/»_ also as series of the same type: 

«A ± = 2 6 ±n «A n o (42) 

This is substituted in (39). Remembering that tp n ° satisfies the equation 

^„ 0 +~(E„- V°)4,„°=0, . . . (43) 

comparison of coefficients yields 

6±n =- ^E k -El\hv (44) 

Thus we can write as the complete solution 

, p2rri (th+v)l pZrri(yL — y)t > 

By using our working hypothesis that e'F'F gives the density of charge, 
we can obtain the electric moment of the atom from this equation. 
The polarization is concerned with the z-component: 

P. = eJ’z'V i VdT (46) 

Substituting the value resulting from equation (45), and using the 
relation 

Jzip n ifj k dT= a nkf 

we obtain the following expression for the temporally variable part 
of the moment (the constant part is of no further interest): 

P, (t) = 2 e 2 A cos W Sa nJt 2 hh ? ' (4?) 

Thus the temporally variable part has the same frequency as the 
incident wave. Since the ratio of the amplitude of the electric moment 
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to that of the electric field strength is the polarizability a, this quantity 
is thus determined. 

If this value is substituted in equation (3') (p. 452), the value 
obtained for the index of refraction in the state k is 


n — 1 = 477 A e 2 Ea„ fc 2 


En- E. 


(E n - E k f - AM 


= 4t7A j la nk * 


V 


nk 


^ ^ 


* » 


. (48) 


If we notice, in addition, that the spectral lines of the atom are given 
by the frequencies v kttf we see that we have obtained the characteristic 
denominator v nk 2 — v 2 of the classical theory of dispersion by means 
of quantum mechanics. Thus, apart from the fact that the constants 
in the numerator have a different meaning, we have here the con- 
necting link with the old dispersion formula of p. 452. 

In the language of the Bohr theory, the quantity a nk is a measure 
of the probability of a transition connected with radiation of fre- 
quency v ni for a spatial distribution of radiant energy as given by an 
oscillator vibrating parallel to the z-axis. Thus a nk is a measure of 
the strength of the classical equivalent oscillator. Comparing the 
constants of the classical and wave mechanics dispersion formulae we 
obtain 



8 irhna nk 2 v nk 

h 



The terms having the negative sign were not included in the classical 
formula. These appear if E n < E k . For this it is necessary for the sub- 
stance examined to be in the excited state, while dispersion measure- 
ments are usually made on substances in the ground state. Using 

excited hydrogen atoms, Ladenburg succeeded in establishing the 
existence of the negative dispersion terms 

The remaining components of the induced electric moment are of 
no consequence for the dispersion, but determine — together with the 
z-component— the scattering power of the atom with respect to the 
incident radiation. We shall not consider the general theory of the 

scattering of light by individual atoms, but shall treat the particularly 
important special case of the Raman effect. 

If several proper functions of a system are active, i.e. if the 
system is m a state of transition, combination frequencies will be 

c °XTv Smetr^f r f a t n * effect — predicted thZei! 

y by Smekal, and found independently by experiment bv Raman 
and by Landsberg and Mandelstam at about the same time Let the 
normal system contain the frequencies * and v, “ by mel 

the same perturbation theory as above, the solution 7 for the 
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perturbed system when a wave of frequency v is incident on the 
system is * 


- I -tp„°e 2 Aa, lk 


e 2rri (f k + u)t p 2nHx, k -v)t 

+ 


E k -E„ + hv E k — E„— h 


— J Htfj n °e 2 Aa nl I g2in’ (vj + y)t 


Kl — tin + hv 

The conjugate solution is 
T* = « /» fc °e _2,rlV ** -|- 

p — 2rri(y k + y)t 


+ TT 


e 2ni (ij — y)l 


E, - E„~h 


(49) 




e -2ni{v h -v)t 


* — + hv E k — E n — hv) 


I 1 V / 0 9 1 ( e“ 2Tf, ' ( •7 + *'> , e -X" i(vi-y)t -v 

+ 1 J - 1491 


g — 2iri(i'/— y)$ 


The scattering is determined by the moments 


M 


^fx'V'Vdr, M v = Jy'Y'Vdr, M.=fz'¥'Vdr. (50) 


It is immediately evident that, in forming these expressions, terms of 
frequency v — v lk and v -j- v lk are obtained, i.e. the effect of the 
scattering is the same as if a quantum of the characteristic frequency 
v ik were added to or subtracted from the quantum hv of the incident 
light. The constants appearing when the moment is determined are 
of great importance. If, for example, we compute M v , and use, in 
addition to 


the abbreviation 




then the term containing v + v lk has coefficients which contain the 
products b nk a nl . This implies that the intensity of a Raman scattered 
ray is not determined by the intensity of the light of frequency v tk 
but by the intensities of lines resulting from transitions from higher 
levels n to the levels k and l. Thus is explained the fact that lines 
which are absent from the infra-red spectrum of a molecule on account 
of the vanishing of a lk can nevertheless give rise to intensive Raman 
lines. 

* On account of the smallness of the perturbation we can neglect all products oi 
perturbing terms. 
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The theory developed here is still incomplete to the extent that it 
predicts equal intensity for the two lines v -f- v ik and v — v lky and 
connects the appearance of Raman lines with excitation of the mole- 
cule. Actually, the “ Stokes ” line v — v lk can appear even at absolute 
zero, while the “anti-Stokes” line v -f v lk only sets in upon excitation, 
i.e. at higher temperatures.* This is readily understood from the stand- 
point of quanta, for at absolute zero a quantum of radiation can transmit 
energy to the scattering molecule but can obtain no energy from it. 

The questions of intensity and of polarization have been solved by 
a more refined theory of the effect. It is possible to gain an insight into 
the nature of what is going on by means of the following considerations 
which are in the spirit of the actual calculation: In questions dealing 
with dispersion, the classical concepts generally prove to be far-reach- 
ing in their validity. Classically, the scattering of light arises from the 
forced vibration of the electron shells, with consequent radiation. If 
the polarizability of, say, a diatomic molecule changes with the nuclear 
separation, then the amplitude of the radiation will vary with the same 
frequency as the nuclear vibration; but a spectroscope analyses this 
modulated wave into its component frequencies, as detailed on p. 58. 


i. The Surmounting of Potential Barriers in the Wave Mechanics. 


The phenomenon of radioactive disintegration, which will be con- 
sidered in Chapter XLII (p. 739), presents us with the following problem. 

By experiments on a-particle scattering it has been shown that 
for Uranium 1 the Coulomb field holds down to a distance of 3 X lO- 12 
cm. Apart from a nuclear charge which is two units less, the same 
potential curve is valid also for the element UX! which results by the 
ejection of an a-particle. For a closer approach, the potential curve 
must finally bend downward, for in the U 1 atom the a-particle is 
generally bound, and hence must be at a minimum of the potential 
curve (fig. 2). Again, an a-particle ejected by U 1 has an amount of 
energy which corresponds to its being released from rest at a distance 
r i = 6 X 10" 12 cm. and being allowed to accelerate in the field of the 
nucleus. The question arises: How does the a-particle cross the 
potential barrier between its normal position and the point r x ? There 
is no possibility of explaining this fact on the basis of classical me- 
chanics Gamow and, independently, Condon first recognized that in 
wave mechanics it is possible to encounter a particle where it does 
not belong on the basis of its classical energy — i.e. every potential 
barrier is surmountable. Indeed, according to the statistical inter- 


“ , based °n Stokes’ law of fluorescence, according to whioh 

^ atl0n /! always ° f , lon 8 er wave-length than the exciting light. The 

Inlr^ level co . naideratio ? of absorption and emission i? teLa of an 
energy-level diagram. If the atom or molecule i a not in an exoited state to be pin with 

° f h0 re ' radiated photon 0411 never he greater than that of the one thLt to 
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pretation of the Schrodinger 0 function there is a definite probability 
that the electron of the hydrogen atom will be found far from the 
nucleus at a place where it could not be expected at all on the basis 
of its energy (proper value) parameter. 



Fig. a 


We shall consider the surmounting of the potential barrier for the 
simple model investigated by Gamow, since we are interested only in 
the essential features here. Assume the potential energy of a particle 
to have a constant value U from x = 0 to x = l and to vanish in the 
remainder of the region. Now allow a pencil of a-particles (or, equally 
well, of electrons) to pass from left to right. According to classical 
mechanics, all particles whose kinetic energy is smaller than U will 
be thrown back, while all particles possessing greater energy will pass 
the barrier. According to the quantum mechanics, neither of these 
statements is exactly true. We limit our present considerations to 
the determination of the number of particles with smaller energy which 
succeed in crossing the ob- 
stacle. To this end we must 
divide the space into three 
regions x < 0, 0 < x < l, 

x > l, and assume a Schro- 
dinger wave for each. The 
boundary conditions must Fig. 3 

also be taken into account. 

The situation is simplest in the third region; there we certainly 
have but one wave travelling to the right, while partial reflec- 
tions are to be expected at the boundaries, so that we must have 
waves moving in both directions in the other two regions (cf. fig. 3). 
The Schrodinger equation in Regions I and III is 



dfy 87 r 2 M 
dx* + h 2 


E<p= 0 , 




SOLUTIONS BY WAVE MECHANICS 


723 


XLI.] 


while in Region II 


dx 2 " t “ 


8t7 2 M 

~h*~ 


(E- Z7)0=O. 



Since we are concerned only with the ratios of the amplitudes, we set 
the amplitude in Region III equal to unity, and thus obtain the three 
solutions for the three regions: 


where 


0 t = o^e + b \ e **** 
0 2 = o 2 e -f- b i e~ iktX 


9 


• • 


0 3 = e-** 


Jc 0 =^VWE, h = ^ V2 M ( E - U). 



The solutions must merge continuously at the boundaries in suoh 
way that at each boundary the number of arriving particles must, on 
the average, be equal to the sum of the number reflected and the 
number transmitted. This signifies the continuity of the 0 function. 
On account of the existence of the Schrodinger equation the first 
derivative must be continuous also, otherwise A0 and hence 0 would 
be infinite at the boundaries. These conditions yield the following 
equations at x—l: 


e - *** 7 = a 2 e‘*» 7 -f- b 2 e~ ik ' f ^ 
e-W = 4- I * 


• . (55) 


Solved for a % and b 2 these become 



g -«(A. 4 


g— i(A* A|)l^ 


Similarly, at x — 0 we obtain 


( 56 ) 
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We are primarily interested in b v i.e. the amplitude of the incident 
wave relative to that of the wave traversing Region III. Substituting 
the values of a 2 and b 2 from (55) in (56), there results 8 



The intensity of the wave is obtained by multiplying the amplitude by 
its complex conjugate value. For E > U this yields 





} “ © ] t 1 ~ fi)l cos2k * 1 



According to classical mechanics, all particles would reach Region III 
when E > U\ the above formula gives the ratio of the number of 
incident particles to the number of particles passing through as given 
by wave mechanics. The case E < V is, however, of greater impor- 
tance. In this case we take 



2 t n 

h 


V2M(U - E) 


9 9 4 



and obtain 



cosh 2 kl. 



Since cosh x behaves like e x when x is large, we arrive at the essential 
feature of the connexion between energy and radioactive constant 
in spite of the simplicity of the model used. The intensity of the inci- 
dent wave, referred to that of the wave in Region III, increases ex- 
ponentially with the difference V — E , or — reversing the statement — 
the ratio of the number of particles which succeed in crossing the 
barrier to the number arriving there decreases exponentially with 
increasing U — E. 

The result has an importance reaching far beyond the special 
problem from which it has been derived here. The overcoming of 
obstacles in quantum mechanics which appear insurmountable in 
ordinary mechanics is important in considering such problems as the 
escape of electrons from metallic surfaces and the kinetics of chemical 
reactions, but these questions cannot be entered into in this work. 
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5. Energy Bands of Electrons in Metals. 

(a) General Survey of the Problem 

The concept of an electron gas moving freely among the ions of the 
lattice, when used in conjunction with the Eermi statistics, is able to 
account correctly for many features of the properties of metals (see 
p. 602). As mentioned there, however, further development of the 
theory must take account of the interaction of the electrons with the 
lattice ; we can no more neglect the large electrostatic forces here than 
we could in dealing with electrolytes. In this way only we arrive at an 
understanding of phenomena which were formerly not even quali- 
tatively explained, e.g. the reversed sign of the Hall effect occurring 
for some metals (cf. p. 733). Further, such a theory would be expected 
to furnish a criterion as to whether or not a given lattice made up of 
atoms of a specified kind will possess metallic properties. These ques- 



tions are actually answered by incorporating the Fermi statistics into 
a wave-mechanical treatment of the motion of electrons in the periodic 
potential field of the lattice. Because of the singular behaviour of 
particles with respect to potential barriers (cf. § 4), this motion differs 
from that to be expected on the basis of particle mechanics. This is 
most easily seen in the one-dimensional case, to which we restrict our 
treatment. This calculation involves less work than the one in three 
dimensions, but qualitatively— and that is all that concerns us for the 
present — it yields the same result. 

Suppose an electron moving in the direction of the positive s-axis 
(fig. 4) to be momentarily in the potential trough at P. According to 
quantum mechanics it can surmount the next crest even if its kinetio 
energy is less than the amount corresponding to the crest. On the 
other hand, even in instances where it has sufficient energy, there 
exists a given— although slight— possibility of reflection, as mentioned 
above. If the de Broglie waves reflected from the several crests agree 
m phase, the result may be a reflection of such magnitude that the 

*° t J 6 "« ht . becomes impossible. This is nothing more nor less 

than the reflection m depth for X-rays, which was treated in detail on 

p. 388. Precisely this circumstance is the key to an understanding of 
the original questions. 6 
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(b) Wave-Mechanical Expression for Current Density 

On p- 703 the expression WW was shown to be a measure of the 

de n S l 7 a* 9 ?' £° W an ana, °g ous expression for the current density 
wih be deduced. Since a current is a process occurring in time we 

must start from the Schrodinger equation (p. 717), which depend^ on 
the time : r 


AT* __ §T 2m TJW A 

w ~~~r Tt =0 * 


. (63) 


According to a theorem in the theory of functions of a complex 

variable, the conjugate function T* satisfies an equation in which all 
coefficients are replaced by their conjugates, since the real and the 

imaginary parts must separately satisfy the same equation. Hence W 
satisfies the equation 


AW — S” 2 ™ rnv i 4?rtm A 

a 2 a a7 _0 * 


(64) 


Multiplying (63) by W and (64) by W and subtracting, we have 
~ a vj-vp = vp A vp _ TAT = div ( Y grad T — Y grad Y). (65) 

The appearance of the gradient of a complex number may seem strange; 
but one may compute exactly as with the gradient of a real quantity, 
since it represents a combination of the gradient of the real part and 
that of i times the imaginary part. 

Since T* has been normalized in such way as to make JWW dr = 1, 

the time rate of change of the charge density p is given by 0(e v F'F)/0£. 
Also, as for any current, the equation of continuity of the electrical 
current is (cf. p. 196) 

= div *, (66) 

where i is the current density vector, 
represented in wave-mechanical form as 


Thus the current density is 


* = ^ h - (Y grad Y - Y grad Y). . . . (67) 

47rim 


The total current carried is given by 


1= AY grad Y - Y grad Y) dr. 
immJ 


• • 


( 68 ) 
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While this cannot be done for (67), it is possible to put (68) into a 
simpler form. Inasmuch as T vanishes exponentially at i n fin ity, the 

integral dr over an infinite sphere is zero. Hence by equation 

(59), p. 31, 

0 = 'F'FdS = J grad'F'F dr=J('¥ gradT -j-T grad T*) dr 9 


and so the total current becomes 


- 2= Z' 1 ' 


(69) 


Since we have T* = ipe iuit and 'F = ipe~ ioit } it is also possible to write 

eh 


1 = ^ * dT (69<) 


If T — tyeiwt and if ^ is real, the expression for i vanishes. This may at first 
sight seem surprising, for on p. 698 we used real ^ functions in treating the hydro- 
gen atom. However, we know from its magnetic behaviour that a rotational 
current is present in this case, as required even by the revolving eleotron of the 
simple atom model. In reality, the existence of a magnetic field compels us to 
carry out the hydrogen calculation using complex <{> functions, which yield a 
current. According to wave mechanics there can be no current without a mag- 
netic field, since the functions give the mean values, and in the absence of a 
magnetic field all orientations of the orbits occur with equal frequenoy and so 
cancel on the average. 


(c) Electron in a Periodic Potential Field 

Let us again restrict our consideration to the one-dimensional case, 
which depicts all the essential facts concerning the behaviour of elec- 
trons in a space lattice. The form of the Schrodinger equation that 
does not involve the time is 


dx* ^ 


Srchn 


(E — V) ip = 0, 





where V is assumed only to vary periodically in space with an interval 
equal to the lattice constant a, i.e. 


V(x -f- a) = V (x). 



If ijj m {x) is a solution corresponding to the proper value E m , then 
iftj.x + a) is another solution corresponding to the same charaoteristio 
value; because if we set x -f a = x' in (70) we get 


d 2 i/j t Svhn 


[E-V(x' 


<*)]«/» = 0 . 




(72) 
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On account of the periodicity of F, this is again identical with (70). 

Since we rule out the degenerate case, this is possible only if both 
solutions differ by a constant factor: 

iP(x + a) = pi/>(x) ( 73 ) 

In a crystal of some extent, the condition i/j must recur after a 
certain distance. Even if we take the whole crystal to be a unit for 
purposes of calculation, we may imagine it to be succeeded by an 
identical region. In the one-dimensional case, assume this fundamental 
unit region to be 

L = Ga , (74) 



where G is a very large number. Since the potential is due to the ions 
of the lattice, G also is the number of ions in the unit region. By re- 
peated application of (73), 

<P{x + Ga) = xjj(x) = p°ip{x), 
from which it follows that 

p = e 2 nim, G and ip(x -f- a) = e 27ltmlGa ip(x), m = 1, 2, . . . G. (75) 

This property is possessed by the function 

fl(z) = e 2mmxlGa u(x), i ... . (76) 

if u is a periodic function whose period is a. If V is zero, the solution 
is simply 

i/j(x) = e 2nimx ' Ga X const (77) 

This represents a plane wave which moves from left to right or from 
right to left, depending on the algebraic sign. The periodic potential 
field, in conjunction with u(x), results in a spatial modulation of the 
wave (cf. p. 57 and fig. 5 here). 

Introduce k m as an abbreviation for 2 tt/A. Then 

, 2 nm 

km “ Ga 


( 78 ) 
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A* = - , m = 1, 2, ... G. . . . 

771 


• (79) 


According to p. 695, the relation between the wave-length and 
speed of a free electron is 

x h 2tt kh 

A = = -y-, or v= . . . . (80) 

m e \V k lTrm c \ 


Consequently the energy of the free electron is 

„ , „ k 2 h 2 k 2 


E = %m c \v 2 = 


87 rhn. 


. (81) 


where 


fi = 


87 T 2 m. 


• (82) 


Since a free electron may have any speed, E and k are related para- 
bolically. 

As long as the potential is small compared with the electron energy, 
we can start from the solution for free electrons and proceed by the 
established methods of quantum-mechanical perturbation theory, 
considering the periodic potential as a small perturbation. For this 
purpose we develop V in a Fourier series: 

V = X W v n <?" inxla , (83) 

n-* - co 

where, according to p. 55, v_ n = v n . Further, let V be normalized to 
make the mean value v 0 equal zero. 

The coefficients v n are to be considered given, and we seek the 
Fourier coefficients, which replace u m in (76) : 

K = ^ kmx u m (x) (84) 

For simplicity, we omit the index m and write 

n-4-co 

u= S ( 85 ) 

n~-co ' 


When these Fourier series are substituted in (70), the produot of 

) senes appears in the term Vifi. If the terms having the same ex- 

made 1I t0 T am ^- equations determining the o n are obtained, 

u ’ c , oUeotm e the terms mvolving e 2 " 1 -*'*, the linear parts 
[70) yield a term containing o., but the produot 7(4 yields a sum nf 
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terms whose index sum is 4, e.g. V&, v 2 c 2) V& 
for c n , are 


The equations, solved 


2 v v c n _ v 

F lk + 27rn/aj 2 (86) 

H- 


As long as the denominator does not vanish, all of the c n , with the 
exception of c 0 , may be allowed to approach zero as v n does. This 
means that the proper functions and proper values of the electrons in 
the weak, periodic field differ but slightly from those in the absence of 
forces. However, it must be noted that the k m now form a discrete, 
yet very dense, set. The situation is different, however, if the de- 
nominator vanishes; then the c n can no longer be small. Thus for 
k = — 77 /a we have 





where, according to (81), E = ? r 2 /a 2 fx. 

The same denominator is obtained for c 0 . In order to investigate 
the character of the result at these places we approach — 7 r/a from 
neighbouring points by trying to determine c 0 and for k = (— 7r/a) + 8, 
the corresponding proper value being E — (»*/«*/*) + «. 

Neglecting all coefficients shown above to be small, we get the 
following equations for c 0 and c^: 


c o 

Cl 



= V-i Ci, 





These linear, homogeneous equations can be satisfied only by the 
vanishing of the determinant. This leads to an equation for el 


or, because v_ x 






v-i v i + 


4t r 2 S 2 
a 2 fj. 2 ' 




In the limit, with 8 = 0, 


€ — ± I V\ I. 


(90) 
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Thus for k = —7 r/a we get two values of E k , of which the smaller is a 
continuation of the E values that correspond to smaller values of k. 
But this means that the parabola mentioned on p. 729 has a break at 
jfc = — 7 r/a, and the same thing occurs at k— +277 /a, 37 r/a, &c. Thus 
there are breaks in the spectrum of proper values. These places corre- 
spond exactly to the reflection in depth of the de Broglie waves from 
the row of points, similar to that of X-rays considered on p. 388. For 
Jc = ±nm/a the waves reflected from the several points are in phase, 
so that no wave moving from left to right can arise. In the three- 
dimensional case these breaks occur for those directions of propagation 



and those wave-lengths for which, according to p. 388, reflection takes 
place. The trend of the proper values is as shown in fig. 6 . 

It is important, for what follows, to notice that the curve is hori- 
zontal at the points of discontinuity, as computation shows. 

With increasing perturbation the breaks in the energy spectrum 
become broader, since the jump rn.Ea.tk — mr/a is conditioned by 
v n . However, for large perturbation, the present point of departure of 
the calculation is no longer the correct one. In that case, one starts 
with the proper functions of the electrons bound to an ion of the lattice. 
The result is that for very strongly bound electrons (K shell) the in- 
fluence of the neighbours ceases to be important. For the outer elec- 
trons the sharp term values of the free atom change to broad bands 
that correspond to the above, except that the breaks no longer occur 
exactly at mr/a. These bands are designated according to the terms 
of the free atoms from which they developed. Thus one speaks of the 
4$ or of the 3 d band of potassium. 
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{d) Relation between Current and Energy Values 

In order to connect the behaviour of electrons in a periodic poten- 
tial held with the conduction properties of matter we shall deduce an 
important relation between the current and the function E(Jc). Despite 
the fact that the proper values constitute a discrete set, they he so 
close together that they may be considered continuous inside a band 
E. If k is assumed to increase by a small amount a, the E k will change 
by an amount £ and the function u k (which has a period a) will change 
by a function v. Thus to k + a there corresponds the proper value 
E k + P and the proper function e i{k+a)x (u k -f v), or, putting 




we have 


0 = OA* + x) e 


tax 


(91) 

(92) 


To determine fi and \ we substitute into the Schrodinger equation. 
Since ift k satisfies equation (70), we get the following equation for x 
by discarding the products x a and u/3: 

d 2 y . „ „ _ di/j k 


j^ + n{E k — V) x = —putk — 2ia 


(93) 


According to p. 708 this inhomogeneous equation will have a solution 
only if the solution x = ‘A* of the homogeneous equation is orthogonal 
to the right side. The situation differs from that on p. 708, however, 
in that ifr k is complex. In this case the condition for orthogonality 
is similar to that on p. 708 except that in place of ifi k we have the 

conjugate value ip k . Then, since Jip k ip k dT = 1, 

2 *[+/+* 

H' 




(94) 


By equating (69), p. 727, the current is given by 

1 = 2 = glad ^ (95> 
Also, in the one-dimensional case, 

^ = %■ 


Then, on account of (82), p. 729, we have the important relation 


V = h i 

a dk 27 re 


or 


I = 


27re dE 
~h die 


(96) 


(97) 
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The quantity dE/dk vanishes at the edges of the bands, as men- 
tioned above. Hence the current, which is carried by electrons having 
the limiting energy, is zero. Physically, this remarkable result means 
that because of the reflection in depth of the de Broglie waves there 
is now a standing wave rather than a progressive one. 

(e) Conduction Properties of Matter; Algebraic Sign 

of the Hall Effect 

We are now in possession of all the essentials for building the wave- 
mechanical theory of metals. To be added, as in the case of an electron 
gas, is Pauli’s principle, i.e the Fermi Statistics. We are then in a 
position to give qualitative answers to the more important questions. 
Space does not allow a description of the quantitative three-dimensional 
theory, for which the special literature should be consulted.* 

A band contains G closely spaced energy levels. The Pauli principle 
requires that every quantum state be only singly occupied, and on 
account of the two possible spin orientations we can thus accommo- 
date 2 G electrons in each band. There are G atoms in the unit region. 
Without regard to spin, each electron in an atomic configuration is 
characterized by the quantum numbers n, l , m, (cf. p. 679). For each 
trio of numbers, designating a single band, there are two possible 
electrons per atom. This is precisely the case for closed shells. For 
example, this is true for the inner as well as for the outermost shells 
of the alkali halide ions, which are like the rare gases in this respect. 
The alkalis, on the other hand, have a single electron outside the rare- 
gas configuration. But, because of the spin, the Pauli principle allows 
for the presence of two such electrons, so that the band corresponding 
to the valence electron is only half-occupied. 

What influence on conductivity does the extent of occupation of the 
bands have? In an electric field the electron obtains energy and so 
moves up to the higher levels within a band. In general, however, the 
energy that can be obtained from the field is not enough to force a 
transition to the next band. If the upper levels are already occupied 
(rare gases) a displacement of this kind is not possible, and the material 

“ . a „ n „5f S j Ulat0r ’ The case “ otherwise for the alkali atoms, where the 
half-filled state of the bands provides enough room for such displace- 

rnents, and so the alkalis in the solid state are typical examples of 
metallic conductors. r 

The case of the alkaline earths is more complicated. The ns shell 
is folly occupied by two electrons. That these elements are neverthe- 

f? Iamed because the adjoining band lies so close 
to the ns band that the two overlap. The effect of the field is a double 

IWO. 8 ® 0 ’ £ ° r example » Fl ^ Tht Mod *™ Theory of Solid*, McGraw-Hill Book Co., 
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one: the number of electrons in the next higher band is increased, 
and vacancies occur in the ns band. The effect of these vacancies will 
be explained below. Another situation that may arise is that two 
bands are separated, but that the distance between them is so small 
that thermal motion suffices to elevate individual electrons to the 
upper band. Substances where this occurs are the simplest electronic 
semi-conductors. For such materials the conductivity does not de- 
crease with increasing temperatures, as it does for metals; rather, 
corresponding to the increasing number of electrons reaching the next 
band, the conductivity increases exponentially. 


P, 


v= 


const 


P2 


+ 

Fig. 7 


The Hall effect, mentioned above on several occasions, is the fol- 
lowing phenomenon: Let a current flow through a rectangular plate 
in the direction of its longer side (fig. 7). The equipotential lines will 
be perpendicular to the lines of flow. Between two points Pj_ and P 2 
lying on an equipotential there will be no potential difference, and a 
voltmeter joining them will read zero. If we now apply a magnetic 
field directed outward and perpendicular to the page, there will be a 
force —e[vH] on each moving electron and it will be deviated to the 
right * The accumulation of electrons on the right side gives rise to 
a transverse electric field held in equilibrium by the magnetic force 
The next effect is that the lines of flow remam parallel to the side ot 
the sample, but the points P, and P 2 no longer are at the same poten- 
tial. For electrons, which are carriers of negative charge, P 2 must be 
negative with respect to Pj. This is the case, for instance, for alkalis. 

• Notice that v for an electron is directed from — to +. 
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For many metals, such as zinc, the sign is reversed, as if positive car- 
riers were taking part in the flow of current. The band theory accounts 
for this readily : It seems fairly obvious — and may be proved strictly — 
that the “ hole ” in an almost fully occupied band acts like a positive 
charge. In an instance where two bands overlap as mentioned above, 
which happens for the elements of the second col umn of the periodic 
table, the conduction is made up of two parts — that of the holes and 
that of the electrons of the higher band. If the former preponderates, 
the sign of the Hall effect reverses. 

The next task is to compute the conductivity and its temperature 
dependence on the basis of our picture of electron waves permeating 
the metallic crystal. As long as the lattice is a perfect one there will 
be no possibility, according to the above treatment, of transferring 
energy to the atoms of the lattice, i.e. a uniform metal has zero resis- 
tance at absolute zero. At higher temperatures, however, the perio- 
dicity is disturbed by thermal motions and the randomly distributed 
departures may be introduced into the computation as a perturbation. 
The calculation is cumbersome but it succeeds in giving a satisfactory 
picture of conduction. 


6. The Role of Lattice Defects in Dielectric Crystals. 

It was stated in the preceding section that a material is an insulator 
or a conductor according as the uppermost energy band is completely 
or partially occupied. In rare cases, the nearest unoccupied band may 
lie such a short distance above the highest fully-occupied one that even 
at room temperature occasional electrons may find their way into the 
upper band. Each electron thus elevated contributes in two ways to 
the conduction: firstly, because this electron is itself mobile, and 
secondly, the hole now remaining in the previously fully-occupied band 
is able to move (“ hole conduction ”). Since the number of electrons 
elevated in this way is governed by the Boltzmann distribution, the 
conductivity of such truly electronic semiconductors increases with the 
temperature according to e - w ' kT , where W is the excitation energy. This 
temperature dependency is well founded experimentally. 

With other substances, e.g. boron, the conductivity of different 
samples is often found to vary by several orders of magnitude, even at 
e same temperature. The cause, recognised only relatively recently, 
is the great influence of lattice defects or imperfections. These may be 
atoms of impurities, holes, or atoms of the material itself which occupy 
bcations between lattice points. If they are present in concentration 

^ eVe Sc. 10 T they 016 able *° alter the conduction properties 
completely. The lattice defects constitute the critical factor m the 

cathode! ap f h “ tl0 “ ° f * tremendous number of materials : oxide 

photo^Us f oho h n tubes - dr y rectlflers . detectors, transistors, selenium 
photocells, phosphors, and photographic emulsions. Preparation of 
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suitable materials for such uses amounts to finding ways of introducing 
the proper detects into the substance. We shall consider two classes of 
materials here— phosphors and photographic emulsions. 

One of the oldest and most widely used phosphors is ZnS con- 
taminated with traces of other heavy metals or rare earths. It may be 
fabricated in forms that show long-lasting phosphorescence, or others 
where the decline is rapid, such as those used to coat cathode-ray and 
television receiving tubes. In the level scheme of such a phosphor there 
is an impurity band C, of finite width, lying higher than the fully- 
occupied band A of the main material (fig. 8). An electron at the 
location of the impurity ion has somewhat higher energy than one in 
the main lattice. In every phosphor there is a second kind of defect 
fulfilling the requirement that the whole crystal be electrically neutral. 
If, for example, the place of a Zn++ ion is taken by a Cu+, there will 
be an excess of negative charge in the neighbourhood. This may be 
compensated, for instance, by the removal of one S — ion for each two 
Cu + ions present. Since a vacancy of this kind corresponds to a positive 
charge, it will attract and hold electrons, acting like an electron trap. 
The corresponding energy level D lies just below the conduction band B. 

The mechanism of phosphorescence is then visualized as follows: 
A photon absorbed by the main substance ejects an electron from A 
and sends it upward to B* In B the freed electron interacts with the 
lattice elements, losing energy thereby and descending to the lower 
edge of band B. Its return from this point to A , which would violate 
a selection rule, is prevented by the circumstance that its former place 
in A was soon filled by an electron from the impurity level. A lattice 
defect in the latter level is completely surrounded by normal lattice 
elements, and so such a transition can take place readily. 

The conduction electron, whose existence can be directly verified 
electrically, moves through the lattice and is trapped in D. Passage 
from here to a vacant place in C will be possible only if the level D 
happens to lie near an impurity atom in which there is a vacancy. This 
seldom happens, and the electron usually is thrown back into the con- 
duction band by the absorption of thermal energy. This happens 
repeatedly until the electron chances to fall into a properly placed trap 
from which it can return to an available place in an impurity ion. The 
latter transition is accompanied by the emission of radiation, and the 
phosphor is then back in its original condition. 

This picture of what goes on, while apparently somewhat artificial, 
succeeds in bringing order to a wealth of observational material. For 
one thing, it explains at once the strong temperature dependence of the 
luminescence which enables the substance to “ freeze ” the absorbed 
light and then give it out again rapidly upon warming. In order to 

* The rarer transitions C to B and C to D occur also, but will be neglected here, 
since there are many more absorbing atoms in the ground level. 
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pass from an unfavourably placed trap to a better one located near an 
impurity ion that is able to accept an electron, the conduction electron 
must take up heat energy to bring it back to the conduction band. The 
band theory also explains the selective effect of infra-red light whose 
quanta correspond to the energy difference B — D. The mechanism is 
simpler only for certain solid organic phosphors, where the absorption 
and emission processes take place within the same molecule, the excited 
state being metastable. 

There is a connexion between the explanation of the luminescence 
of inorganic phosphors and that of the latent photographic image. The 
energy level scheme corresponds to that of fig. 8. The effective im- 
purity centres are known to be S ions, which find their way into the 
silver bromide granules from the gelatin of the emulsion. When the 
necessity of sulphur impurities was finally discovered, many puzzling 
cases of insensitivity in emulsions were cleared up. 


Conduction 
band \ 



B 


Trap level — T — 


Z> 


Impurity band 

Fully-occupied 

band 



-c 


Fig. 8 


In order to compensate the double charge of each S~ ion there 
must somewhere be a missing Br~ ion. These Br vacancies we the 
electron traps. Compared with phosphors, the new feature in addition 

^te h i« fT 7 e j e f r0IUC P rocess “ this = Passage of an electron to a trap 
site is followed by a motion of ions, which leads to the formation of 

nsitmty centres. Even at room temperature, the silver halides 

B^ermu? ?£££*}* , eleCt . ro *^ ic faction, borne mainly by the 
ofanW' Th “ has to d° with the smallness of these ions. In the case 
of a photographic emulsion, there is also a shift of the Br~ ions to he 
tehen into account, and there is a corresponding movement o^- 

pl ~ * “» ■ ? 4*“°~ ctS. 

should fafi into the Br- hnl a £ pen ’ for exam P Je > if a second electron 
out o, u- * B ‘ Ex P eri ments with alkali halides bear this 

ZL I* f Ch mStances > little dusters consisting of Ag+oS a^d Br 

a tiny crystal of metallic silve “ nothin 8 more nor les s than 
2 s 7 metaUlc Sllver > wtuch serves as a sensitivity centre. 

(B 711) 
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The decrease in sensitivity at low temperatures is then understandable 
on the ground of reduced ionic mobility. Further, the quenching of 
the latent image by infra-red light (Herschel effect) is explained as the 
elevation of the trapped electron to the conduction band. Also, the 
fact that the photographic density is found to depend not only on the 
product of intensity by time but also on the way in which the energy 
is introduced during exposure is explained by the above considerations. 



CHAPTER XLII 


Nuclear Physics 

1 . General Remarks on the Physics of the Atomic Nucleus. 

Now that the structure of the electronic shells of the atom and the 

processes going on in them have been revealed in their main outlines, 

the interest of physicists since about 1930 has chiefly been centred 

on the atomic nucleus. Here both experiment and theory indicate 

relationships essentially different from those which apply to the outer 

layers of electrons, and we shall accordingly begin by discussing these 
differences. 

The most important distinctive feature of the physics of the nucleus 
is the imm ense amount of energy which, as has long been known from 
the facts of radioactivity, is involved in any nuclear phenomenon. 
While the amounts of energy involved in changes in the outermost 
shell of electrons (excitation, ionization, chemical binding) are of the 
order 5 electron volts, the amounts of energy transformed in nuclear 
processes are over 10 6 times as great. This energy ratio is even greater 
than we should expect from the dimensions of the nucleus and of 
the outer shells— that is, by comparing the potential energy of two 
elementary charges, first when at a distance apart of the order of the 
diameter of the atom (10~ 8 cm.) and again when separated by a dis- 
tance of the order of size of the diameter of the nucleus (10~ 13 to 10 -12 
cm.). Owing to the magnitude of the energy liberated it is possible, 
w , co “ tra di8 tl notj°n to the physics of the outer shells, to observe indi- 
vidual phenomena In this way we are made aware of uncommon 
occurrences, which, though they elude the methods of large-scale 

LJw ’ T 7 have , an extremely important bearing on cosmological 

the^Imrf' becaus . e , ther f e the rarit y of the event is compensated fm by 
the immense periods of time available. These large amounts of energV 

manifest themselves in the form either of fast corpuscX rays S 

(y r^nrtl aU< °l ° f radiation of ver 7 short wavelength 

nL/that^ thouI aS6 ° f y ' ray3 ’ however . must be borne in 

bebTreiIvrtei 6XP r men r ta u 7, determinations ” 

ttZlv ST 0M ° f \ h6 ma ^ tude light quantum h v . 
y> hese energy-rich corpuscular rays and y-rays may be 
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used to bring about changes in the nucleus. We have them at our 
disposal in the rays emitted by the radioactive elements and in the 
cosmic rays. The latter were discovered by V. F. Hess as far back as 
1910 and have since been the subject of an enormous number of ex- 
periments. It may be taken as established that these rays are diffusely 
incident on the earth from without, coming from all directions in 
space, and that they are a mixture of the most diverse types of ray, 
whose energy may reach 10 17 electron volts and perhaps even more. 

In recent years we have found means of giving protons, deuterons 
and a-particles velocities far in excess of those of “ natural ” radia- 
tions (except those of cosmic-ray particles). Thus, in the cyclotron 
a-particles may be given energies amounting to several thousand 
million electron volts, as compared with a-rays from radioactive sub- 
stances, whose energy is never as much as ten million electron volts. 
Further, particularly energetic particles are obtained also from arti- 
ficially radioactive elements (cf. § 4). 

Several methods are available for detecting the individual pheno- 
mena. The simplest of these is the scintillation method, which is very 
suitable for a-particles and hydrogen nuclei or protons. The energy of 
these is so great that when an individual particle hits a screen of zinc 
blende it gives rise to a flash of light which is visible in a microscope. 
Owing to the differing brightness of the flashes it is possible to dis- 
tinguish between a-particles and the feebler protons. The velocity of 
the particles is determined from their range, i.e. the distance from the 
source of radiation, in air at 0° C. and 760 mm. pressure, at which they 
can no longer produce scintillation. The relationship is an empirical 
one; for a-particles the formula 


R = 0-965 X 10" 2 V, 


( 1 ) 


where R is the range and v the velocity, is widely applicable. Accord- 
ing to Blackett’s experiments, in which the velocity was obtained from 
the magnetic deviation, the formula 

R = 1-77 X lO-®** 4 (2) 


is more satisfactory for protons. A formula which has a more accurate 
theoretical basis, but which is very inconvenient in practice, has been 
given by Bethe. Reference may be made to equation (5) on p. / 42. 

Still more elegant and direct is the cloud chamber method due to 
C. T. R. Wilson. In this method the paths of the particles are made 
visible owing to the ions formed along these paths being caused to act 
as nuclei for the condensation of supersaturated water vapour. Here 
again the nature of the particles is revealed simply by the intensity 
of the track (the paths of electrons can also be made visible in the 
same way). As before, the velocity is obtained from the range, that is, 
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the length of the track. A better method consists in placing the chamber 
in a magnetic field and measuring the resultant curvature of the track. 
This method is especially useful in revealing the sign of the charge 
carried by the particle. Recently it has been found that a-particles and 
protons leave tracks in a photographic emulsion — a method that is 
valuable for recording rare events without requiring special attention. 

Again, the ions produced by a particle can be detected directly by 
means of specially sensitive electrometers, such as those due to Hoff- 
mann. Or the ions first formed may be made to give rise by collision 
to a large number of new ions, so that less sensitive instruments may 
then be used to detect the total current. This is the principle of the 
Geiger and Geiger-Miiller counting apparatus. In both types of instru- 
ment the ions first produced give rise to a gaseous discharge which is 
then rapidly extinguished by a suitable device. Depending upon the 
magnitude of the applied potential, a counter may be made to give a 
current pulse proportional to the number of initially produced ions or 
else each particle may be made to produce a pulse of the same strength. 
It is necessary to use weakly ionizing particles (electrons) in the break- 
down range. Counting and recording devices may be actuated by 
means of suitable amplifiers. Recently, in place of some of the above 
instruments, scintillation counters have come into use. Here the 
flashes are amplified by photo-electric means and then registered. Suitable 
scmtillation materials permit the recording of fi- and y-rays as well. 

For very energetic particles such as those found in cosmic rays 
(§15) it is often difficult to distinguish their nature. The charge is 
usually recognized as a multiple z of the electron charge, but the rest 
mass and the velocity (or the kinetic energy) are unknown. Although 
the requirements are usually prohibitive, it may be possible to observe 
the path of the particle in a strong magnetic field directed normal to 
its path In such case, equation (14) of p. 429 furnishes one relation. 

P ro( * uct H P> called the magnetic rigidity, tells us the value of 
mvc/ze. But at the extremely high velocities encountered, the relat- 
ivistic mass increase becomes important, and the value sought is the 
rest mass w 0 , which may be very much smaller than m. 

m ftnrPfh i Sp ! akmg> the P roduct Hp gives us a relation between 
and the kinetic energy E. Thus in equation (14), referred to above, 

mia o e replaced by mjV 1 — t^/c 2 . From equation (82) on p. 257, 

E ~~ WqC2 (vi - v 2 /(? ~~ 0 

solved for Using the result and also w oV(i - ^ 2 ), 
U P = Vi* T 2 m^Ec* 


( 4 ) 
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A second relation is found from the energy loss per unit path length 
which the particle experiences by ionization of the air or other material 
through which it travels. Bethe derived a relation for very high energies 
E^$>eV it where F, is the complete ionization potential of the atoms 
composing the medium ; it may range as high as 10 kv. His equation 
for the energy loss per unit distance is 


d_E 

dx 


4-77-e 4 2 2 iV 

m^v 2 




The mass of the particle does not appear, so that all particles having 
the same charge and speed will suffer the same loss. The charge number 
of the particle is z, while the atomic number of the atoms of the medium 
is Z and their mean ionization potential is V t . The number of atoms 
per unit volume of the medium is N and C is a correction term which 
may be neglected at high velocities. 

Inspection of the equation discloses that the energy loss, and hence 
the intensity of ionization, decreases rapidly as the particle speed in- 
creases, approaching a flat minimum. For protons in air, this minimum 
corresponds to energies of about 2300 Mev. The number of droplets 
per unit length in a cloud chamber or the number of developed grains 
per unit length on a photographic plate increases with the number of 
ion pairs, so that the energy loss can be found empirically from such 
observations. The thickness and curvature of the tracks in a magnetic 
field almost always permit of unique identification. In the absence of 
a magnetic field, the energy may be found from the range of the particles. 
Suppose the energy loss has been determined as a function of the dis- 
tance traversed. Numerical integration will then give E as a function 
of the distance x and hence — dEjdx as a function of E. Then the range 
is given by 

* - s |6> 


Since R is observed, this is an equation for the determination of the 
unknown initial energy E 0 . Because of the uncertainty of the energy 
loss measurements, one usually starts from the empirical range-energy 
relation for 15 Mev particles and integrates from this point up to the 
observed range. As a result of the simple dependence on z and on the 
particle mass M, equation (6) is especially suitable for computing the 
range of a particle of mass M' and charge z' from the known range of 
a particle of mass M and charge z having the same velocity.* Taking 
the energy, as usual, to be the abscissa, corresponding points will have 
different abscissas as well as different ordinates. (Cf. Ex. 134 opposite.) 

♦ dEjdx is independent of M, but E, and therefore dE, is proportional to M . 
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In the case of electrons, there are two other mechanisms of energy 
loss in addition to that due to ionization : Compton effect and Brems- 
strahlung (according to even the classical theory, an accelerated electron 
or other charged particle radiates electromagnetic waves — the con- 
tinuous X-ray spectrum). Further, the formula for dE/dx includes 
other terms that can no longer be neglected when dealing with electrons, 
so that the range formula (5) is no longer applicable to electrons. For 
heavier particles the retardations are never so great as to make the 
Bremsstrahlung important. 

So far as the theory of nuclear physics is concerned, the goal we set 
ourselves is the deduction, by application of wave mechanics, of the 
various nuclear states from some sort of model, as was accomplished 
in the case of the electron shells. Matters are still very much in a state 
of flux, but a number of important results have been obtained from 
the principles of conservation of momentum, energy and charge, whose 
validity is assumed to extend to the nucleus. Of quite special impor- 
tance is the law of the inertia of energy (cf. p. 258), which takes the place 
of the conservation of matter. It enables us to calculate the release of 
energy in a nuclear reaction from the mass defect. In this connexion 
the following transformation formulae should be noted: 

1 Mev = 1-60 X 10- 6 erg = 1-78 X 10~ 27 gm., . . (7) 
Mass of electron = 5-12 X 10 5 electron volts. . (8) 

Thus if a mass decrease of 10~ 2 3 a.m.u. (p. 765n) should occur in a nuclear 
reaction, energy amounting to about 1 Mev would be liberated. 


Ex. 134. An a-particle (M = 4, 2 = 2) with an energy of 12 Mev has an 
observed range of 14-42 cm. Find the energy of a proton (M = 1*008, 2 = 1) 
which will have this range. 


2. The Hyperflne Structure of Spectral Lines as a Connecting Link 
between the Physics of the Outer Shells and that of the Nucleus. 

In addition to the methods peculiar to nuclear physics which were 
described in the previous section, spectroscopy, which is a method of 
ordinary large-scale physics, has its contributions to make to the 
problems of nuclear structure. One way in which spectra are affected 
by the nucleus depends in the first instance on the joint motion of the 
nucleus and its surrounding shells; this, however, is only of importance 
m the lightest elements. In single-electron systems the motion about 

ca^T 0 fVv ntre ,° f , maSS ° an be caloukted directly: we have already 
carnal out tbs calculation on p. 652 for the interval between the H 

sbnificL • Th ‘ S effeot of the mass of the b^eus has gained fresh 

Zf 2 T m With the 111168 of the *°tope of hydrogen of 

kZiL TT by m U / 6y ’ Briok wedde, and Murphy, whose nucleus 
is called a deuteron. The lines nf .he — i_ , . 
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M n in equation (31) (p. 652) by twice its value. The proportion in 
which the two isotopes occur may be deduced from the ratio of the 
intensities of their spectral lines. While ordinary hydrogen contains 
only one part in 5000 of the heavy isotope, it is easy to obtain a richer 
mixture, owing to the magnitude of the relative difference of mass. 
(With other elements, as is well known, the difficulties of separating 
isotopes in weighable quantity are very great.) One way of carrying out 
the separation, for example, is by the electrolysis of water, in which a 
preponderance of the lighter hydrogen isotope is set free. 

Apart from this effect of the nucleus due purely to its mass, there 
is an electrodynamic interaction which gives rise to a splitting up of 
terms additional to that involved in the systematic theory of spectral 
terms developed on p. 669. This is referred to as hyperfine structure. 
The phenomenon has long been known in the case of the mercury 
lines; owing to the slightness of the splitting these lines have often 
served as tests for interferometers of high resolving power. 

If we ascribe an angular momentum of magnitude //i/277 to the 
nucleus it is possible to develop a systematic theory of the hyperfine 
structure. The total spin of the atom is then given by F (fine quantum 
number), the resultant of /, the moment of the nucleus, and J (inner 
quantum number), the moment of the electronic shells. The spin 
vectors I and J are combined to form the resultant F in the same way 
as L and S are combined to form J , J corresponding to the vector L. 
When J > I a figure the same as that on p. 670 but with a different 
interpretation of the vectors gives a hyperfine structure with (2/ -j- 1) 
sublevels to every term. The Zeeman effect may be treated in the 
same way. It is to be noted, however, that owing to the smallness of 
the splitting in fields of the necessary intensity there is an uncoupling 
of the vectors J and I corresponding to the Paschen-Back effect, 
which gives the Zeeman type of the multiplet component in question 
a hyperfine structure of (2/ -j- 1) components. The mechanical moment 
of the nucleus is accordingly determined by the number of components. 
Very many observations of fine structure have been made and they 
show that atoms of even atomic weight have integral , but in most cases 
zero , nuclear moment ; whereas in atoms of odd atomic weight the nuclear 
moment is always equal to an odd multiple of one-half. 

The magnetic moment of the nucleus, however, is not determined 
by the mechanical moment of momentum, as we do not know whether 
the latter is due to circulating electrons or to nuclei. If we replace m, 
the mass of the electrons, in equation (60) (p. 666) by M n , the mass 
of the proton, the magnitude so obtained is called a nuclear magneton ; 
it is accordingly 1840 times as small as the Bohr magneton. If we 
make an approximate calculation of the term perturbation due to a 
magnetic dipole situated at the nucleus, the very small splitting gives 
the order of magnitude of the magnetic moment as that of a nuclear 
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magneton. Direct measurements, using molecular-beam techniques, 
lead to the same result. This furnishes one of the first objections to the 
earlier idea of nuclear elections, whose spin would give rise to one Bohr 
magneton per electron. 

From the discrepancy between the observed hyperfine structure 
and the separations to be expected from the multiplet rules, Schuler 
has drawn far-reaching conclusions about an ellipsoidal distribution 
of charge in the nuclei of many atoms (electric quadrupole moment). 

In addition to this hyperfine structure occurring in a single isotope, 
there is another observed in elements which consist of a mixture of 
isotopes; here the lines or groups of lines correspond to the individual 
isotopes. The combination of these two effects make it difficult to 
analyse the hyperfine structure in elements such as mercury which 
have many isotopes. Inasmuch as the mass effect (p. 651) of the nucleus 
is least for the heavy elements, this type of hyperfine structure shows 
a difference in the nuclear field which is not that of a point charge. 
In fact, this isotope effect is greatest for the lowest S terms (e.g. the 

65), whose electrons approach the nucleus very closely owing to their 
large eccentricity. 


3. Radioactivity. 

Radioactive phenomena are amongst the earliest known facts of 
atomic physics. On them the establishment of an experimental basis 
for the model atom essentially depended. As is well known, the heaviest 
elements spontaneously emit rays which consist partly of radiation of 
short wave-length (y-rays), partly of fast electrons (£-rays), and partly 
ol He++ particles (a-rays), the nature and energy of the rays being 
characteristic of the particular heavy atom in question. It was soon 
recognized that the source of these rays is the nucleus of the atom, so 
tnat the study of radioactivity represents the first phase of nuclear 
p ysics. As a-rays and £-rays cany electric charges, the loss of one of 
these particles alters the charge on the nucleus and a new element is 
ormed. Loss of an a-particle diminishes the nuclear charge by two 
uwte, so that the element moves back two places in the periodic table 

tW an 6 , CCtr0n rais6S the P° sitive nucIear <*arge by one unit, so 
nerind? e ^ ^ ahead ° f the former element in the 

and siddf e Tf Th T arG the celebrated displacement laws of Fajans 
back tei tbp q 1 6 ? ut a disbt egration series, we keep coming 

SeSr: p i ace peno u ic tabie > but ^ a different ^ 

Of kotones a ?d Vh Cien 7 ere * hus forced recognize the existence 

methodsLf ma l sp rciropJ OU ° m °° 8 ^ ^ ** 

we have^^stertfr 00111 .! ° f & transformat ' 0 " process as time goes on, 

any partiralw atom 0 ^- ^'; exp ? nmeutal fact tha t the probability of 

disintegrating in the following instant is quit# 
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independent of external conditions, depending only on the internal 
structure of the nucleus. We accordingly represent this probability 
by a characteristic constant A. The probability that a selected nucleus 
will disintegrate during the elementary interval of time dt is then A dt , 
and if there are a large number (A 7 ) of nuclei, the number of nuclei 
disintegrating in the interval dt is NX dt, by the fundamental principles 
of statistics. We therefore have the differential equation 

dN = —NX dt (9) 

(the negative sign being necessary, as the number N is decreasing). 

Integrating and taking N 0 as the number of atoms when t == 0, 
we obtain the number of atoms left at the end of a finite time, 

N = N 0 e- Xl (10) 

That is, the disintegration constant A is the reciprocal of the time 
required for the number of atoms of the original substance present to 
fall to 1/e of its initial value. In addition the half- value period ( T ) is 
often given, i.e. the time in which the number of atoms present is 
reduced to half its value. Substituting NJ 2 for N in equation (10), 
we see that 

t = \ log. 2 (ii) 

As a rule the product of disintegration is itself radioactive. If, as 
in the case of uranium deposits, we have a long-lived parent substance 
whose diminution may be neglected, a state of equilibrium among the 
disintegration products is ultimately reached, atoms of each type 
being produced and being destroyed at the same rate. If we let the 
suffix 0 refer to the parent substance and the suffix 1 to the first dis- 
integration product, the increase in N 1 due to the disintegration of the 
parent substance is 

dN x = NqXq dt, 

whereas the loss by further disintegration is 

dNf = iVjAi dt. 

In a state of equilibrium we accordingly have 

N 0 X o = N,X V (12) 

N 0 _T_o 

01 Nj ~ T{ 

That is, the numbers of atoms of the disintegration products of a long-lived 
parent substance present when radioactive equilibrium has been reach 
are in the ratio of the half-value periods. This relationship serves 
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determine the half- value periods of very long-lived types of atoms 
from those of other atoms which can be measured directly. 

The fact that in the case of the a-rays there is a definite connexion 
between the probability of disintegration and the energy of the par- 
ticles shot out has long served as a stimulus to theoretical speculations 
on the mechanism of disintegration. For the shorter the life of an 
element, the greater is the energy set free in its disintegration. One 
might be tempted to compare this with the great velocity of strongly 
exothermic chemical reactions; here, however, the phenomena are 
quite different. The high velocity of a chemical reaction arises from 
the fact that the heat developed in the reaction increases the number 
and violence of the atomic collisions, whereas in radioactive transfor- 
mations the relationship mentioned above applies to the individual 
atom. The relationship may be represented in a range extending over 
fifteen powers of ten by the following empirical formula involving two 
constants which are characteristic of the particular radioactive series: 

l°g A = A -f- B log E (Geiger-Nuttall) . . (13) 
or 


log A = a -f- bVE. 


(Swinne). 


• ■ • (14) 


The second form exhibits the rapid (exponential) growth of the prob- 
ability of disintegration with the energy. This feature, however, is also 
brought out if we regard disintegration from the quantum mechanics 
point of view as the crossing of a potential barrier, such as we investi- 
gated m a very simple case on p. 721. For this phenomenon, which 
plays a large part in nuclear physics, we shall in future use the ex- 
pression quantum mechanics tunnel effect ”, as the particle, so to 
speak gets past the potential hill by going through a tunnel. 

Attempts have been made to find analogous formulas for 8 disin- 

but here ’ as , we sha " see * atfir (§ 9, P- 763), the theoretical 
aspects are more complicated. For artificially radioactive nuclei an 

co^ C f, f k r l0 u h '? (Sargent ’ s formula ) been obtained which 

gUy , equation (13), except that the constant A has 
Afferent values according as the nuclear momentum (nuclear S T) 
.s altered by 0, 1 or 2 in the atomic transformation in questioT P 


4. Artificial Transformation and Excitation of the Nucleus. 

thaifwe m heSi ° n b ! tW6en the constitue nts of the nucleus is so great 

aiy 2S r* th , at \ wm be *> *** aK 

high energy 11111638 by usln 8 corpuscular rays of very 

S ^succeeded as long ago as 1921 in bringing about a 

gate the protons it by meai j ls of a ' ra y s , and proceeded to investi- 
g te the protons (hydrogen nuclei) liberated from hydrogen by «-rayL 
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For the collision of a-particle and proton the calculation carried 
out on p. 641 must be modified, inasmuch as the mass of the nucleus 
struck is not large compared with that of the a-particle, but is actually 
smaller than the latter. There is no difficulty in calculating the motion 
of the two bodies relative to the common centre of mass. The velocity 
imparted to the hydrogen nucleus is very large; if the impact were 
exactly central it would be 1-6 times that of the a-particle. Hence in 
the Wilson chamber the hydrogen nuclei behave in the same way as 
the a-particles except that their tracks are somewhat finer. Their 
velocities are found to be distributed in accordance with theoretical 
expectations. 

The really significant feature of this work of Rutherford’s, however, 
was that he established the occurrence of these hydrogen nuclei or 
protons in nitrogen which previously contained no hydrogen whatever. 
Fig. 1 (Plate II) shows the liberation of a proton from nitrogen. The 
short track going sideways from the point of collision is that of the 
nucleus 8 0 17 , which consists of what is left after the nitrogen nucleus 
7 N 14 and the a-particle have united and discarded a proton.* Thus 
the process consists not merely in the destruction of the nitrogen 
nucleus, but in the building-up of a new and heavier nucleus, a super- 
fluous proton being ejected. Protons were subsequently found to be 
liberated from a very considerable number of elements, even the 
heavier ones, especially when very fast artificially produced a-particles 
are used. In ail these transformations, so far as Wilson chamber 
experiments show, the a-particle remains in the nucleus. 

The fact that in many cases the energy of the protons far surpasses 
that of the a-particles used to liberate them (the range of the protons 
from aluminium is actually equivalent to a range in air of as much as 
80 cm., the range of the original a-particles being only 5 cm.) shows 
that what we are dealing with here is an elementary nuclear reaction, 
involving a transfer of energy ( AE ). This we shall reckon positive, as 
in thermochemistry, when energy is set free in the course of the trans- 
formation. If we attach the suffixes a, H , k to all quantities referring 
to the a-particle, the proton, and the newly-formed nucleus respec- 
tively, we have the energy equation. 

\M^ + AE = \M h v* + \M l ? v k *. . • (15) . 

To express the conservation of momentum we denote the angles 
which the paths of the proton and of the nucleus make with the path 

* In nuclear physics it is customary to designate a given isotope by attaching the 
atomic weight as an index (upper right) and the nuclear charge as a suffix (lower left) 
to the chemical symbol. Further, a reaction is often written down in abbreviated form 
by putting first the bombarding particle and then the emitted particle in brackets 
between the initial nucleus and the final nucleus. Here, for example, we should wri e 
,N 14 (a. p) s 0 17 . The symbols used are a = He+T p = jH l » d = ,H , n = neutron, 
e - = electron, e = positron, y = light quantum, v = neutrino, m = meson. 



Plate II. 



Fig i. — Ejection of a proton from nitrogen 
by an a-partic!e (p. 748) 

[. Proc . Roy. Soc. A, 136 , Plate 8. fig. 1) 



Fig. 2. Disintegration oi boron by protons (p. 7S ,). 
the two distinct groups of a-particles) 
[Photograph by F. Kirchner] 


Not 
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of the incident a-particle by 0, 0 respectively; we then have the 
equations 

M B v ti sin 0 — M k v k sin 0 = 0, .... (16) 

M n v„ cos 0 -j- M k v k cos 0 = . . . (17) 

From these we readily obtain 

= 2W i M »V - 2M u M x v ll t) 0l cos 4, + (18) 

or, since M a = 4 M ny 

M 2 

\M k v k 2 = {v 2 — 8v B v a cos 0 + 16i) a 2 }. (19) 


Substituting this in (15) above, we obtain an equation which for a 
given value of the energy transfer gives v a as a function of the angle 0 
and the energy of the a-particle. In many cases the energy transfer 
can be calculated from the mass defect and the distribution of v B is 
in excellent agreement with theory. (The value of v B is determined by 
the range, that of 0 from stereoscopic photographs.) In some cases 
several groups of protons were found, corresponding to several distinct 
values of AE. This is obviously explained by the fact that when the 
energy of the protons is fairly small the nucleus is left in the excited 
state. The energy of excitation is probably given out later as a y- 
quantum; the occurrence of y-rays in nuclear disintegration has 
actually been established. All these observed facts justify the assump- 
tion of the validity of the energy and momentum theorems, as well as 
tHat ol the existence of definite energy states of the nucleus. 

The penetration of the a-particle into the nucleus is obviously 
another case of the quantum mechanics tunnel effect. Here, however 
the direction of motion is exactly opposite to that in radioactive dis- 

£ to ™ X1 ° n ^ ich We ** encoun tered the tunnel 

of tL n 1-J- i u a "P ^ rtlcIe advances in opposition to the potential 

reach P ° h Sltl . vel ^' char f ed nucl eus, crosses the potential barrier, and 
reaches the interior of the nucleus. According to p. 747 the prob- 

a llity of crossmg the barrier decreases rapidly as the difference be- 

tween the energy required and the energy available increases This 

tisrs - fj p- A sst'bi 

"X , er h “ d ’ the P° tential b *™r, „ was first recog- 
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The first of these artificial transformations brought about without 
the use of radioactive substances was the decomposition of lithium by 
protons which had been subjected as hydrogen canal rays to a potential 
of about 100,000 volts, the experiments being made by Cockcroft and 
Walton working in Rutherford’s laboratory. By means of the scintil- 
lations produced on a fluorescent screen they were able to detect the 
presence of a-particles of energy 8-6 x 10 6 electron volts. The be- 
haviour of lithium, from which the yield is particularly great as com- 
pared with other atoms which can also be disintegrated by protons, 
was subsequently investigated very thoroughly. Wilson chamber 
photographs by Kirchner clearly showed that in each reaction two 
a-particles appear. We accordingly have the equation 


3 Li 7 + ill 1 + E ~ } = 2 2 He> + 2 


( 20 ) 


or, in numbers,* 

7-0180 + 1-0081 + 0-0001 = 8-0077 + 2 -^. 


Using the convenient relation that 1 a.m.u. (atomic mass unit) is 
equivalent to 932 Mev (million electron volts), the above numbers 
show that an energy of 17-2 Mev has been released. The measured 
kinetic energy of both a-particles exceeds that of the initial proton 
by about 17-0 Mev, in excellent agreement with the value computed 
by conservation of mass-energy. 

The fact of the conservation of momentum is also established by 
Wilson cloud photographs. It is found that the two tracks of the 
a-particles going in opposite directions enclose an angle of 5°, the 
value calculated for protons with the velocity used being 4° 50'. The 
discrepancy of 10' is of course less than the experimental error. 

Apart from these energy-rich a-particles. the existence of two 
definitely slower groups of a-particles has been established. These are 
the fragments arising from the disintegration of 3 Li 6 according to the 
reaction 3 Li 6 (p, a) 2 He 3 . That is, one particle is an “a-particle of mass 
3 Hard y-radiation is also observed, arising from the reaction 
.,Li 7 ( p , y) 3 Be 8 . The yield of disintegrations is represented by an ex- 
ponential function, as one would expect in view of the occurrence of a 
tunnel effect. With the lowest potentials (20,000 volts) 10 lD protons 


* In nuclear reactions we are of course concerned only with the masses > of th 
nuclei , but we actually use- the masses of the atoms, i.c. of the - nuclei plus the etortronj, 
in our calculations. This does not introduce any error, since thc ™ of 

the same on both sides of an equation representing a nudear reaction 
stability, in fact, we must carry out the calculations in this way (cf. Exercise loo, 

p. 703). 
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are required to bring about one disintegration, whereas with potentials 
of 200,000 volts only 10 9 protons are required.* 

The disintegration of boron has also been investigated very 
thoroughly. Here, in addition to a-particles of certain definite ranges, 
there is a continuous velocity spectrum having a distinct maximum. 
This is perhaps due to the fact that the reaction takes place according 

to the equation 

5 B 11 + ,H' + ^ = 3 2 IIe> + 3£ j a) , . . (21) 


or, in numbers, 

11 0129 + 1 0081 + 0 nooi = 12-0117 + 3 E ^. 

When, as here, there are three fragments the energy and momentum 
theorems are insufficient to determine the distribution among the 
individual particles. The reaction s B u (p, a) 4 Be 8 also takes place. This 
transformation gives rise to particles of definite range (see fig. 2, 
Plate II, facing p. 748), while the reaction of equation (21) corresponds 
to the shorter tracks seen in the picture. 

Particular importance attaches to disintegration experiments with 
fast deuterons produced like the protons as hydrogen canal rays. When 
lithium is bombarded by a mixture of deuterons and protons a fresh 
group of a-particles with the high energy of 12 X 10 6 electron volts 
is produced, over and above the groups liberated by bombardment 
with protons only. On the basis of the mass defect this new group 
corresponds within the limits of experimental error to the disintegra- 
tion of the lithium isotope 6 in accordance with the equation 

3 Li* + X H 2 + = 2l He 4 + 2 . . (22) 


or, in numbers, 

6-0169 -f 2-0147 + 0-0001 = 8-0077 + 2 

While nuclear transformations by means of fast electrons have 
hitherto been observed in a few cases only, y-rays of sufficiently high 
frequency are more effective. Of particular importance for what 


.. . F or ^on p) 1 H 3 D6pel obtained the surprising result that the lower 
lmi is experimentally fixed by the least potential required to produce canal rays and 
y.. he nuclear reaction coming to an end. The fact that nuclear reactions do take 
p ace, i seldom, at velocities as low as those corresponding roughlv to the velocities 

molecu !®f m the interior of a star at a temperature of a million degrees at last 
provides us with a source of energy for stellar radiation. 
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follows is the observation that bombardment of materials containing 
deuterium gives rise to protons. But what is the residue? If from a 
particle of mass 2 and charge 1 we remove a proton of mass 1 and 
charge 1, we are left with an entity having zero charge and mass 1. 
Here for the first time we meet with a new elementary particle, the 
neutron. Strong evidence for its existence was already forthcoming 
from other directions before the above-mentioned observations were 
made ; this evidence we shall discuss in the next section. 

5. The Neutron. 

Up to the year 1932 it was reckoned as among the incontrovertible 
facts of physics that the positive charge is always associated with the 
mass of hydrogen, the negative with the mass of the electron, and that 
these two particles, protons and electrons, form the ultimate con- 
stituents of matter. However, in that year, in rapid succession, two 
new elementary particles were discovered, the neutron mentioned 
above and the positive electron, or positron . Of these, the neutron has 
revolutionized our ideas of nuclear structure. 

In 1930, Bothe and Becker observed that beryllium and other light 
elements when bombarded by a-particles, emit not only a hard y-radia- 
tion but also a penetrating radiation with entirely different properties. 
In substances containing hydrogen this radiation sets free protons of 
high velocity, altnough in the Wilson chamber it leaves no traces of 
itself; that is, the proton tracks, so to speak, start from nothing. If 
we begin by assuming that here we are again dealing with an extremely 
hard y-radiation — owing to the great penetrating power a- and £- 
rays are at once ruled out — we are forced to interpret these protons 
as a Compton effect which has taken place in the nucleus, to which 
the calculations of p. 688 apply if we replace the mass of the electrons 
by the mass of the protons. In this way we obtain the value 55 X 10 6 
electron volts for the hardness of the y-rays, while according to the 
mass relationships the bombardment of beryllium can only yield a 
much smaller amount of energy. The frequency of these proton tracks, 
moreover, is about a thousand times as great as we should expect 
from the assumed Compton effect. Further, a y-radiation would 
necessarily set free photo-electrons also; no trace of these is to be 
found. Finally, the velocity imparted to nitrogen nuclei by the radia- 
tion is three times as great as would be produced by a light quantum 
of 55 X 10 6 electron volts in a Compton effect on the nitrogen nucleus. 

We can extricate ourselves from all these difficulties if we assume 
with Chadwick that this radiation consists of charge-free corpuscles, 
which owing to the absence of electric charge cannot have any 
ionizing effect and hence are not retarded by the outer shells of tlie 
atom. Such particles do not differ from light quanta, which also have 
a mass hv/c 2 , except that they do not move with the velocity of iiglit. 



NUCLEAR PHYSICS 


753 


XLIL] 


Their collision with a nucleus is taken as being like the collision of 
elastic spheres. This means that the forces between the colliding par- 
ticles decrease according to a very high power of the distance. The 
nature of these forces is still a matter of speculation. If we calculate 
the velocities which a proton and a nitrogen nucleus will acquire in 
colliding centrally (these can both be obtained from their ranges), 
the principle of the conservation of momentum gives the two equations 



2M n v n 
M v + M 



2 M n v n 
M s + M n 



where the indices p, N , n refer to proton, nitrogen nucleus, and neutron 
respectively; hence 


_ M s -{- M n 
v „ M v + M n ’ 



From this equation Chadwick obtained the value 1-15 + 01 for the 
mass of the neutron. 

The exact mass of the neutron can be found by comparing masses 
in various nuclear transformations. The simplest to use is the reaction 
uJjH 1 (see top of p. 752), which has also been observed to take 
place in the reverse direction, jH^n, y) t H 2 . We have jH 1 = 1-00814 
and jH 2 = 2-01473, from observations with the mass spectrograph, 
taken relative to 8 0 16 = 16. Also hv = 2-23 Mev and E n is approxi- 
mately zero, so that 

o^ 1 == 1-00898 

Owing to the absence of charge on the neutron, which means that 
it can react only with nuclei, it exhibits very remarkable properties 
when made to pass through matter. As there are more nuclei in 1 c.c. 
of paraffin- wax then in 1 c.c. of lead, beams of neutrons are more 
weakened by paraffin-wax than by an equally thick layer of lead, 
fins serves to distinguish between y-rays and neutrons. Further, 
em e ding the ionization chamber in paraffin-wax reveals an impor- 

Kwi? 6C ^ /rf neu toons, for then beams of protons are shot out 
oy tne paraffin-wax and ionize the gas in the chamber in their turn, 
^eutrons, however, are detected most readily by the fact that they 

rhodinm 6 ra dioactive isotopes of various elements, in particular, 

of the neutron is 
proton (1-00814), 
a breakdown into 


The mass (1-00898) and hence the energy contec 

the Sum of the masses of the electron an 
so that the neutron should be unstable with respect t 
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the other two particles. In fact, such “ radioactive decay ” of the free 
neutron has been observed, the half-life being about 12 minutes. 

Nuclear disintegrations brought about by beams of neutrons have 
been observed in great numbers. Curiously enough, the very slow 
thermal neutrons are often the most effective and very sharp “ res- 
onance energies ” are found. As might be expected, neutrons are much 
more effective than charged particles, for the majority of the latter fail 
to act because their energy is spent in detaching outer electrons from 
the bombarded atoms. 


We may amplify this last point a little. As we saw in the case of 
the disintegration of lithium, most nuclear processes are highly exo- 
thermic reactions. The question naturally arises why these reactions 
do not spread like the explosion of a mass of gunpowder. The reason 
is that it is only in an infinitesimal number of cases that the energy 
of the charged products of disintegration, though very great, can again 
act effectively in a nuclear collision. The possibility of bringing about 
atomic transformations on the large scale is therefore exceedingly 
small. If neutrons are used the conditions are much more favourable, 
as then the only particles which suffer retardation are those which 
actually collide with a nucleus. The usual means of producing neutrons 
by means of a nuclear disintegration yields only a small supply. If, 
however, there were a process which would set free more neutrons than 
were shot in to bring it about, the possibility of a chain reaction would 
exist, provided that the volume of material is so large that 
neutrons are lost. This possibility was opened up by the discovery in 
1939 by Hahn and Strassmann of the fission of the uranium nucleus. 
Whereas all previous artificial nuclear processes involved the splitting- 
off of elementary particles only, fission causes the uranium nucleus to 
break up into two fragments of intermediate weight and a number of 
neutrons. This will be discussed more fully in § 10. 


only a few 


6. The Positron. 

Anderson and, independently, Kunze found that when a strong 
magnetic field was used, cloud chamber photographs of cosmic-ray 
particles showed tracks of a new type whose curvature corresponded 
to positive particles, but their intensity to electrons. Confusion with 
protons is ruled out by the fact that proton tracks so strongly curved 
could not have the range actually found, and the possibility of a mistake 
about the direction in which the track was traversed was excluded by 
making the track pass through a plate of lead, when the side where the 
curvature is greatest of course represents the later portion of the track. 
Subsequently a large number of cloud photographs showed that these 
tracks almost always occur along with an electronic track originating 
at the same place; * in fact, whole " showers ” of positive and negative 
electronic tracks are sometimes found starting from a single pom 




Plate III. 



Pig. i. — “ Showers” of electrons 
and positrons liberated by 
cosmic rays (p. 754 ' 

[ Proc. Roy. Soc. A. 139. I’ late 22 ] 



I i K . z. -Klectron pair, positive 
and negative, released by y-rays 
(p- 755 1 
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(cf. fig. 1 , Plate III). These double tracks may also be produced using 
y-rays; however, a photon of over 10 6 electron volts is always re- 
quired (fig. 2, Plate III). Soft y-rays do not produce the phenomenon. 

This minimum value of 10 6 electron volts serves as a hint for theory, 
for it corresponds exactly to the mass of two electrons. It is therefore 
feasible to interpret the occurrence of positive and negative electrons 
as meaning that a light quantum becomes materialized, transforming 
itself into one or more “ electron pairs ”. The conservation of charge 
and energy is guaranteed, but the conservation of momentum cannot 
hold, the reason being that the light quantum of all particles of the 
same energy (rest energy -f- kinetic energy) is that with the greatest 

on p. 254 and formula (82) on p. 257, 
the momentum of a material particle of rest-mass is 


momentum. For by formula (72) 


and its energy is 


_ . m 0 v 

Pm - VO - ^/c 2 )’ 
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for the light quantum, on the other hand, 


so that 



. (25) 
. (26) 
. (27) 



Hence the phenomenon can occur only in the presence of a nucleus 
which will take up the surplus momentum. The small energy transfer 
connected therewith may be neglected. 

Apart from this materialization ” of light quanta, however, 
Curie and Joliot have observed that positrons also appear when alu- 
minium is disintegrated by a-particles. In this process protons of 
very great range had previously been obtained as a product. Now 
a process in which a neutron and, subsequently, a positron are emitted 
clearly exists as well. Later, positrons were recognized to be entirely 
normal accompaniments to nuclear changes. 

Now why is the positron found less frequently than the electron? 
itven before the discovery of the positron Dirao had developed a 
generalized wave-mechanical theory of the eleotron (p. 711) in whioh 
™e 8 P m appears quite naturally. We could not go into this theory in 
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detail, owing to the advanced nature of the mathematics involved 
formerly it was regarded as a defect in the theory that it permitted 
the appearance, even if only for the time being, of positive electrons; 
but these very entities have now been observed. The annihilation of the 
positrons obviously results from their union with electrons with the 
emission of y-radiation. Here the momentum theorem can only be 
satisfied if two or more quanta are produced simultaneously. Obser- 
vations on the passage of y-rays through matter make it probable that 
materialization and dematerialization of the y-quantum play a part 
in addition to normal absorption. 

7. Artificial Radioactivity. 

Curie and Joliot made the discovery that the emission of positrons 
from aluminium continues for some little time after the bombarding 
radiation is cut off, dying away exponentially. They interpreted this as 
meaning that when aluminium is bombarded with a-rays an unstable 
nucleus is formed which is transformed into a stable one according to 
the laws of atomic disintegration. 

Subsequently this induced radioactivity was found to occur in hun- 
dreds of nuclear transformations. This is not surprising, for in most 
cases the balancing of the masses gives isotopes which differ from those 
already known. They are not to be found in nature because, if they ever 
did exist, they must long since have been transformed into the stable 
forms. 

We can, in fact, understand the nuclear transformations discussed 
in § 4 (p. 747) better if we assume that in every case the bombarding 
particle is captured to form an unstable intermediate nucleus, which 
in these cases has a life too short to measure. 

Atoms rendered artificially radioactive have found many important 
applications in medicine and in industry. For example, by administer- 
ing small amounts of radioactive sodium to a patient, its progress 
through the blood stream can be followed by means of a Geiger counter. 
Similar experiments can be performed with plants in order to find out 
how they assimilate and distribute foods. In engineering, the rate of 
wear of bearings has been determined by previously introducing a 
small amount of a radioactive metal into the bearing metal and then 
measuring the activity of the lubricating oil. 

In all artificially radioactive elements the emission of either elec- 
trons or positrons is observed and sometimes y-radiation as well. The 
reason for the preponderance of the first two phenomena, collectively 
/teferred to as ^-transformations, will be made clear in what follows, 
/the occurrence of y-rays accompanies many processes. In addition, 
ffhere is a process in which the nucleus captures an external electron. 
tOne might expect that a positron ejected from the nucleus m a p - 
process coAd combine with one of the external electrons. If this were 
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the case, one would always observe y-radiation of 0*5 Mev energy as 
described above for positron annihilation. However, this radiation is 
never found, and only the K series of the element in question is observed. 
It must then be assumed that in this process a K electron plunges into 
the nucleus and then, as is usual in K emission, the place thus vacated 
is filled by another electron from the L, M, or some higher shell. How- 
ever, in order to preserve the energy balance in such K capture , it has 
become necessary to assume the existence of another kind of particle, 
the neutrino , which has only recently become available to direct observ- 
ation. This will be discussed further in § 13. 


8. The Role of Protons and Neutrons in Nuclear Structure. 

The neutron opens up a new way of explaining the difference be- 
tween atomic weight and nuclear charge, and thus enables us to drop 
the assumption of nuclear electrons. The following are the chief objec- 
tions to nuclear electrons: 

(1) The small magnetic moment of the nucleus, already mentioned 
on p. 744, which according to all the observations is of the order of a 
nuclear magneton, not of an electronic magneton. 

(2) From other observations, the proton and probably every 
elementary particle has a spin of 1/2. Experimentally it is known 
(p. 744) that nuclei of even atomic weight have spins of even number, 
nuclei of odd atomic weight spins of odd number. This is incompatible 
with the assumption of nuclear electrons. For if Z p is the number of 
protons, the assumption that the nucleus is built up of protons and 
electrons means that the atomic mass A is equal to Z„ and that Ze, 
the number of electrons in the nucleus, is equal to Z p — Z. The sum 

. 1 • . — _ _ — 2Z„ — Z = 2A — Z. Whether 

this is even or odd, then, depends on the atomic number Z, not on the 

atomic mass A. On the other hand, by the laws of combination of 
spins given on p. 669, an even number of particles always gives a total 
spin which is a whole number, and an odd number of particles a total 
spm which is a multiple of 1/2. Hence X H 2 and .N 14 , which have odd 
atomic numbers, would each have a spin which is a multiple of 1/2 
whereas it is actually found to be 1 in both cases. 

(3) According to the indeterminacy relation, if the electron is to 
be restricted to a space of nuclear dimensions it must have a momentum 

?L m ° rde L° f 6 X , !° 15 8“»- Gm -/ Sec - and hence an energy of about 
100 Mev. Thus, although we obtain for the electronic shells term 

” ££ ° f A* 16 ener gJ states of tie correct order of magnitude, the 

of 10 ^ CaSe ° f * he nucleus are to0 hi 8 h b y more than a factor 

in the nicW ^“tegration d there are no electrons^ 

i The f ? doWm S acoount of the state of affairs may be* 
P ible by analogy. If an atom emits a quantum of visible light we 
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do not conclude from this that the electronic shells contain light quanta, 
but on the contrary that the light quantum originates during the 
passage of the electronic shells to a lower energy state. Now the energy 
relationships in the nucleus are such that the energy set free in the 
passage to a lower state is sufficient to cover the rest mass of the elec- 
trons and their velocities. We therefore assume that the emission of a 
nuclear electron is associated with the transformation of a neutron into a 
■proton, and the emission of a positron with the transformation of a proton 
into a neutron. But, as in the case of the emission of light, we cannot 
attempt to describe exactly how the process takes place. The charged 
particle emitted then serves to balance the charges. Instead of the 
emission of positrons, however, an electron of the K shell may be cap- 
tured by the nucleus, in which case only lines of the K series are 
emitted (p. 654). 

If we assume that nuclei are built up of protons and neutrons we 
obtain the energy of binding as the equivalent of the total mass of all 
the protons and neutrons less the actual nuclear mass.* This is always 
less than the sum of the masses of the individual neutrons and protons; 
that is, when they combine to form a nucleus, energy is set free. Other- 
wise, of course, nuclei would not be stable. The matter may be illus- 
trated in a particularly clear way by means of a graph: Relative to 
a set of axes in three dimensions we plot Z p , the number of protons 
(which is equal to the charge on the nucleus) along the x direction; 
Z n , the number of neutrons (which is equal to the difference between 
the atomic mass and the atomic number) along the y direction; and 
the (negative) energy of binding downward in the z direction, so that 
the nuclei of least energy lie lowest. In this way we obtain a surface, 
although it is true that we know only a narrow strip of it corresponding 
to the known natural and artificially produced isotopes. But from this 
we can surmise that the surface has the form of a trough falling away 
from the origin. The bottom of this valley contains the stable nuclei 
and at first runs diagonally in the first quadrant of the xy plane, the 

origin being the highest point (fig. 1). 

If we look at the numerical values of the energies of binding we 

see that they run from 0 to 1800 Mev. Since 932 Mev = 1 a.m.u., this 
corresponds to a change in atomic weight of up to 1*9 units. The fact 
that pure isotopes have atomic weights which are approximately whole 
numbers when all the values intermediate between 0 and 1*9 can occur 
is due to the fortunate choice of 0 = 16 as a standard of atomic weig t. 
As the atomic weights of the proton and the neutron exceed unity by 
0-008 or 0-009, 230 particles will make the difference mount up to 1*9, 
so that the mean binding energy per particle is about 0-0085. Hence, 


* In computing this difference, the mass of the H atom is to be used in place of the 
proton m^ P and in any ease the mass of the entire atom rather than merely the nuclear 

mass. (See p. 763.) 
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as the surface falls away fairly evenly (see fig. 2), the atomic weights 
are approximately whole numbers. 

The dots in fig. 1 represent the known stable isotopes. At the start, 
the dots lie on the diagonal line in the first quadrant because the atomic 
weight is twice the atomic number for the light nuclei, i.e. the number 
of neutrons is equal to the number of protons. All points lying in the 



Fig. i 


plane Z v = const, parallel to the Z n axis and perpendicular to the 
plane of the paper have the same nuclear charge — they are isotopes. 
All points lying on a vertical plane (Z n -\- Z v = const.) parallel to the 

diagonal of the second quadrant of the xy plane have the same mass — 
they are isobars. 

This method of approach enables us to deduce much information 
about the stability of nuclei relative to the transformations we know 
to occur on the emission of a-rays, electrons or positrons (both of the 
latter referred to as ^-transformations). Only the state of least energy 
corresponds to a system in equilibrium. In thermodynamics it was 
emphasized that it is the minimum of the free energy, not of the total 
energy, that is involved ; nevertheless, this distinction is not important 
in nuclear reactions taking place on earth because the thermal energy 
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is negligible compared with the nuclear energy, which means that we 
are practically at absolute zero, so far as nuclear reactions are con- 
cerned. Thermodynamic considerations, however, are quite relevant 
in the interior of stars, where temperatures of millions of degrees are 
believed to exist. In a- disintegration, where both Z v and Z n are 
diminished by 2, the magnitude of the energy E must satisfy the 
relation 

E(Z V , Z V )>E(Z P - 2, Z n - 2) + E( 2, 2). . . (29) 

As we may regard the change by 2 units approximately as a differential, 
we may write instead 

2 (sz, + az„) > ■ E(2, 2 *’ • • • • (30 > 

i.e. the steeper the fall of the energy surface, the more assured is the 
stability. If, for the known nuclei, E is now expressed as a function 
of the atomic weight, this corresponds to the intersections of the 
bottom of the valley with the planes Z v + Z n = const. We thus 
obtain fig. 2. From it we see that, beginning with atomic weights of 
about 120, the inclination of the surface decreases, and a-disintegration 

among heavy nuclei. 


occurs only 



In order to discuss the stability of ^-disintegrations, the surface 
concept must be further refined. The energies of binding calculated 
from the atomic weights are such that with a more accurate plot we 
can no longer make all the points he on a smooth surface ; the surface 
becomes crinkled. If, however, we join the points for which Z v is even 
and Z n is even {A even), those points for which Z v is odd and Z n is 
odd (A even), and, finally, the points for which A is odd (either Z or 
Z n odd), we obtain three smooth surfaces which we denote by E tey 
E w and ' E uu . Then an isobaric section gives a single curve of inter- 
mit is not necessary to distinguish between E w and 
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section, or two curves, according as it corresponds to an odd or even 
atomic weight. This is shown for an even atomic weight in fig. 3 a. 
The E uu curve lies above the E ee curve, from which we conclude that 
in general nuclei for which Z P and Z n are odd are unstable with respect 
to /3-transformations, since, of course, the next lower point can be 
reached by such a change. Actually there are only four nuclei of this 
type: jD 2 , 3 Li 6 , 6 B 10 , 7 N 14 . The existence of these nuclei is possible 
because at the start the walls of the nuclear energy surface are not far 
apart and are steep, so that we have the stage of affairs shown in 
fig. 36. The favouring of the type ee is rightly associated with pairwise 
cancelling of the spins. As the ^-disintegration takes place in an iso- 
baric section, it is only a question of whether there is a nucleus lower 



Stable Unstable 

nuclei nuclei 

Fig. 3a 



Fig. 3 b 


down in the immediate neighbourhood — at any rate, double trans- 
formations with emission of two electrons have not so far been ob- 
served. If we make the section through a nucleus of the type ue , there 
are other nuclei at unit distance, which are /3-unstable since they lie 
higher. Hence if the atomic weight is odd , there are no stable isobaric 
substances. This is Mattauch’s first law of isobars. 

Since the E uu surface has no stable nuclei beyond jN 14 , all the points 
corresponding to stable nuclei of even atomic weight must lie on the 
E„ surface, but they are 2 units apart. That is, a ^-disintegration 
results m passage to the E uu surface, with the formation of a new un- 
stable nucleus. From the position of the two surfaces we see from 

•fWiT at i here 3 Stable nuclei are P ossible > while the two next ones, 
with the relationships we have assumed, have a lower neighbour in 

the A surface, i.e. they are ^-unstable. We thus obtain Mattauch’s 

second law of isobars : If the atomic weight is even there are several stable 

^obancsubsUmces, but they are never those for which the nuclear charge 
differs by unity. a 

In order to arrive at a statement about the possible isotopes, we 
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project the points of the energy surface and the bottom of the valley 
on to the xy plane. We can immediately omit all the nuclei of type uu 
as unstable (in fig. 4, in which the stable nuclei which actually occur 
are shown by black circles, the unstable nuclei are crossed out). If, 
for example, we consider the isotopes of Br, the two nearest the bottom 
of the valley are K Br 79 and ^Br 81 . If we describe isobaric sections 
through them, within which ^-transformations take place, we see that 
they he lowest in these also, as they are nearest to the bottom of the 
valley. The nearest possible isotopes, however, he higher in their 
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isobaric sections than other nuclei, i.e. they are ^-unstable. We thus 
obtain Aston’s isotope rule: elements of odd atomic number (such as 
Br) have at most two stable isotopes. In the same way, we see imme- 
diately from the figure that a larger number of stable isotopes is pos- 
sible when the nuclear charge is even, especially when the E ee and 
E w surfaces are far apart. A good many of the ^-unstable atoms 
marked by empty or crossed-out circles actuahy have been found in 
nuclear transformations, so that now we know something of the region 

around the valley bottom.* , 

By means of the energy surface we can also explain the remark- 
able “ resonance energies ” mentioned in §5 as being observed m the 
binding of neutrons, i.e. particularly effective velocities which olten 

* See, for example, Sullivan. W. H., Trilinear Chari of Nuclear Species, New York, 
Wiley, 1949. 
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amount to only a few electron volts and thus are of quite a different 
order of magnitude from the nuclear energies. If we go along the 
valley of stable nuclei from one isobaric section to the next, the surface 
falls by about 8 million electron volts. In adding a neutron we are, it 
is true, not moving in this direction of greatest fall, but in the isotopic 
section; however, the new nucleus will in any case always lie several 
million electron volts lower. On the other hand, an excited state of 
this nucleus may come to about the same level as the original nucleus, 
and this energy of excitation corresponds to the order of magnitude 
of the nuclear energies within a few million electron volts. If now the 
approaching neutron has sufficient kinetic energy to supply exactly the 
amount needed to make the energies correspond, binding takes place 
without any accompanying phenomena. The excited nucleus then 
passes to its normal state with the e mis sion of y-rays and there follows 
a ^-transformation to the lowest point of the isobaric section in which 
the new nucleus lies. Excited nuclei, like excited atoms, can be meta- 
stable, and then have a life which is long in comparison with that of 
metastable atoms, amounting even to hours. This explains the fact of 
nuclear isomerism — the occurrence of nuclei of the same atomic weight 
and the same charge, but with differing lengths of life. 

The representation by means of the nuclear energy surface, although 
very easy to follow, does not give a perfectly accurate description of 
the stability relationships in the isobaric section. It is on the difference 
in atomic weight that the relative stability of two nuclei depends (see 
Ex. 135). Owing to the slight difference between the mass of the hydro- 
gen atom and that of the neutron, however, the difference in energy 
between two nuclei in the same isobaric section is not exactly equal 
to the difference in atomic weight, for in advancing along the isobaric 
section the ratio of the number of protons to the number of neutrons 
changes. Hence in critical cases it is advisable to plot the deviation of 
the atomic weight from integral values over the ZN plane in the various 
isobaric sections. If this is done, however, the various isobaric sections 
can no longer be brought into relation with one another. 


Ex. 136. Show that a nucleus of atomic mass A and nuolear charge Z is 
stable as regards ^-transformation and transformations involving ^-capture if 
its atomic weight is less than that of its neighbours Z + 1 and Z — 1. (Write 

down the equation for the conservation of energy, taking into consideration the 
mass of any electrons produced or destroyed.) 


9. The Liquid Drop Model of the Nucleus. 

» I* w y? all y w f nt to calculate the binding energies and the stationary 
states ol the nuclei, we must make a model of the configuration of the 

„ C t , ear com Ponents. Here, unlike the situation for the atom as a whole 
ere is no central body, so that the planetary model is not appropriate! 
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The similarity between the constituents — the difference between 
charged and uncharged particles does not play a big part here and is 
neglected m what follows— suggests that we take a liquid drop as our 
model. To explain the very strong cohesion of this droplet we have to 
mtroduce a new kind of " nuclear force ” acting between the con- 
stituents, no matter whether they are charged or not. The fact that 
the energy always comes to about the same amount, about 8 Mev 
for each new particle taken in, shows that the forces (like those binding 
a liquid together) fall off rapidly with the distance, so that only the 
interactions with immediately neighbouring particles are important. 
With forces which, like electrostatic forces, diminish relatively slowly 
with distance, the energy of the mutual action between the newly 
arrived particle and all those previously there must be taken into 
account, so that the binding energy per particle must rise rapidly with 
the number of particles already present (see below). In actual fact, 
experiments on the scattering of fast protons by protons have shown 
that for very close approaches there is a deviation from the classical 
scattering formula derived from the laws of electrostatics, which 
proves that, contrary to the Coulomb repulsion, there exists a force of 
attraction which decreases rapidly with distance. 

The droplet model even yields the finer details of the situation, for 
example the existing slight progressive change in the binding energy 
per particle, i.e. the sum of all the particle masses minus the nuclear 
mass, all divided by the nuclear mass, or the quantity hM/A. If we 
think of the constituents of the nucleus as in the closest spherical 
packing, each particle has 12 neighbours, and the energy set free by 
bringing in another particle is 6I/ 0 , where U 0 is the binding energy 
between two particles. From this, however, we have to subtract two 
quantities: (1) a diminution of the binding energy of the particles lying 
on the surface, each of which interacts with only six neighbours, and 
(2) the electrostatic energy of the protons which owing to mutual 
repulsion has the opposite sign to the energy of nuclear attraction. So 
far as the former is concerned, we can consider its variation with the 
number of particles A, as follows: The volume increases in proportion 
to A, so that the radius of the droplet increases as A 1/3 . Its surface, 
therefore, varies as A 2/ 3 , and the contribution of this quantity per par- 
ticle is proportional to A 2/3 /A = A~ 113 . 

For the electrostatic energy, the slow decay of the forces makes it 
necessary to calculate the energy of a sphere of radius a having a 
uniform charge density p as in Ex. 80, p. 290. The energy is propor- 
tional to p 2 a b = p 2 A 513 . Since the ratio of protons and neutrons does 
not change much with atomic weight, we may regard p as constant to 
the degree of approximation sufficing here, and the contribution per 
particle becomes approximately proportional to A 2 ' 3 . Hence the re- 
lation between A and the binding energy per particle is expressed by 
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a formula with three constants, 


bM 

A 


6£/ 0 - K U A _1/3 - K C A 213 . 



With the empirically determined values of U 0 , K 0 , and K c this formula 
expresses the relation surprisingly well, as shown by fig. 5 (p. 766). 

The left-hand scale gives the binding energy per particle expressed 
in millions of electron volts and in thousandths of an atomic mass unit. 
The right-hand scale is the so-called packing fraction , given in ten- 
thousandths of a unit. This quantity varies in close correspondence 
with the binding energy per particle.* 

The increase in the electrostatic energy has the further consequence 
that, with increasing atomic weight, the lowest energy state of a 
nucleus no longer occurs when the number of neutrons equals the num- 
ber of protons. Instead, for the stable nuclei, the number of neutrons 
increases more and more with respect to the number of protons until 
finally only a-unstable nuclei are possible. 

We shall refine equation (31) somewhat and at the same time put 
it into a form which makes it easier to recognize the meaning of 
the constants. In order to account for the slopes of the sides of the 
nuclear energy surface we must add a term representing the neutron- 
proton ratio. The lighter stable nuclei have N — Z, while for the 
heavier ones there is a progressive increase in the neutron excess. From 
this we conclude that the binding energy is greatest for N = Z, and 
that the deviations in the total energy are attributable to the increase 
in the electrostatic energy. Consequently, we add a term in (N — Z) a . 
By analogy with the case of binary solutions in physical chemistry, 
this energy term will depend only on the relative concentrations, so that 
it will take the form b(N — Z) 2 /(N + Z) 2 . Further, we represent the 
nuclear radius by 

r = v4 1/3 (32) 

Finally, we insert a term representing the splitting of the energy surfaces 
attributable to the pairing of the spins. This term is somewhat harder to 
justify theoretically. Using u = 6t/ 0 , the result is 

bM , /N — Z \ 2 3 ZV 

T = + b \N + z) + cA ' V3 + 5 c o A ~ in + Y kA ~ m , 03) 


* It may be useful to summarize the relevant definitions at this point. One o.m.w. 
w ** . e mass 0“ and the isotopic mass of an atom spocies is its atomic mass ex- 

pr^sed m a.m.u. The mass number is the nearest whole number to the isotopio mass, 
ana all known isotopic masses fall within 0-1 of the olosest integer. The small difference 

f . ^ defect (or mass correction). Finally, tho packing fraction is the mass oorreo- 
uon divided by the mass number. 
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y = +1 for uu, 

— 0 for ue , 

= — 1 for ee. 

The numerical values of the constants obtained after smoothing out the 
small irregularities of the nuclear energy surface are, expressing 8M in 
thousandths of a mass unit, 

u 0 = 15-7, 6 = 22, c = 16*5, 

= 0-646, k = 36. 

5r 0 

The constant in the nuclear radius formula (32) has the value 

r 0 = 1-41 X 10 -13 cm.: (35) 

curiously, this is the radius of a sphere over which the charge e could be 

spread uniformly to have the electromagnetic energy correspon ding to 

the mass of the electron. The “ classical electron radius ” is represented 
by 2r 0 . 

It must be mentioned that the energy w 0 representing the binding 

energy of a nucleon involves not only potential energy [represented by 

the negative term in equation (33)] but a kinetic part (positive terms) 
as well. 



10. The Fission of Uranium. 

This process, mentioned at the end of § 5, may be visualized re- 
markably well by using the liquid drop model of the nucleus. Firstly, 
because of the existence of surface energy, the most favourable form is 
the sphere. However, because of Coulomb forces, a more elongated 
shape may become more advantageous inasmuch as this allows the 
charges to be farther apart, on the average. In fact, a splitting into two 
spheres can result in an increase in energy. As for water drops, supply- 
mg energy can bring about vibrations that result in a necking-in of :he 
drop near the middle. The question arises as to when such a situation 
will become unstable, leading to actual splitting into two fragments. 
f , ln er to investigate the stability we develop the functions of 

surface area, volume, electrostatic energy— according to 

EL? f® tl0 ?i i the P0lar 6 - With departures from the 

f W f m . dep6 r d ° n ?’ but we atm 433,11116 that drop 
° f f 0tat !° n - ^tead of the angle 6 itself, we use spherical 

Sri?L SI L Ple P ft ° lyn ° mials “ 663 8 (“epp. 819-824). TheLdvam 

WbrationnAb^T fuD S tlonB Resent solutions of the equation of 
oration of the drop. Because of the symmetry of the vibrating drop. 
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f{rr 9) must equal /($), and the uneven powers of cos 9 must be absent 
Hence we may write 


r' = r( 1 + a 0 P 0 + <hP 2 {0) + ...), 


(36) 


where P 0 = 1 and P 2 = |(3 cos 2 9 — 1). Multiplying by sin 9 dd 
and integrating from 0 to tt gives us the value of the surface of the 
deformed drop. The result, carried to the second order, is 


F' = + 2a 0 + |a 2 2 ). . 

Then the surface energy amounts to 

E 0 = C A™{1 + 2a 0 + fa 2 2 ). . 

In the same way, the volume of the drop comes out at 

F = ip(l + o« () + fa 2 2 ). . 


(37) 


. . (38) 


• (39) 


Inasmuch as the nuclear “ fluid ” must be considered incompressible, 

(40) 


a 0 5 a 2 2 


Then, according to equation (38) the surface energy becomes 

E 0 = cA 2/3 (l + Jo, 2 ) (41) 

In the expression for the electrostatic energy, the mean of the reciprocal 
of the distance between charges, 1/r is to be replaced by 1/r'. Alter 
some computation, this is found to be 

W = Vr (1 - lAf) (42) 

We now have for the total change in energy 

2c 


3 ZV 


AE — AE {) + AE st — 5 A 2/3 a 2 2 25 ^ 


CL , 

1/3 U 2 • 


(43) 


If the second term exceeds the first in magnitude, the slightest dis- 
turbance will succeed in splitting the nucleus. The critical condition, 
determined by equality of the two terms, is given by 

( Z2 \ (44) 

V A /crit “ 3e 2 /5r 0 ^ ' 

and, using the numerical values from (34) and (35), the resulting value 
of the above fraction is about 50. Since Z is equal to about 2A/5 for 
the heaviest nuclei, the values of the critical atomic mass and critical 

atomic number turn out to be 


A 


mu* 


312 and Z_ ^ 125. 
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It appears, then, that actual nuclei having a Z < Z ^ will require 
excitation in order that oscillation and consequent necking-in take 
place. The amplitude of this oscillation will determine whether the 
energy state so attained is stable or not. It seems reasonable to express 
this energy of excitation as the product of the surface energy by a 
function of the ratio (Z 2 /A)l(Z 2 IA) ait . The function may be specified 
for the simple special cases of [a) weak charge and ( 6 ) proximity to the 
critical charge; and by means of skilful interpolation Alvarez and 
Bohr were able to trace the entire course of the function to some degree. 
The results show that the nucleus 92 U 236 requires an excitation energy 
of 6-8 Mev, while 32 U 239 requires 7*1 Mev. According to the formula 
for binding energy, the formation of U 236 by the addition of a neutron 
to U 235 liberates 6-81 Mev; on the other hand, formation of U 239 by 
the combination of a neutron with U 238 releases only 5-37 Mev. The 
considerable difference results from the fact that U 236 is of ee type, 
while U 239 is ue. Thus the critical factor here is the y-term in the nuclear- 
energy formula, which determines the splitting of the energy surface. 
It turns out that U 235 can be split even by thermal neutrons, whose 
energy is practically zero, while splitting of U 238 requires fast neutrons 
with an energy of about 1*5 Mev. Since the fission products have too 
many neutrons compared with their nuclear charge, free neutrons 
appear in the process and these are able to propagate a chain reaction. 


II. Liquid Drop Treated as a Fermi Gas. 


At the end of § 9 it was pointed out that u Q is the difference between 
the potential energy and the kinetic energy which is also present in 
the nuclear fluid. As with the electron shells, we get some idea of the 
distribution of the kinetic energy by treating the nucleus simply as 
a Fermi gas. It is true that the masses are considerably greater, so 
that according to p. 632 it might be doubted that the conditions for a 
Fermi gas obtain here. On the other hand, the density is so much 
greater that this circumstance is more than compensated. Each nucleon, 
and indeed every true elementary particle, must be assumed to have 
spin 1/2, which can appear in either of two orientations. If we differen- 
tiate between protons and neutrons, each cell may be occupied by two 
particles. For a first approximation the difference between proton and 

neutron may be overlooked and each cell of phase space may be occupied 
by four particles* r 

We make a very simple assumption about the shape of the potential 
energy curve, corresponding to the short range of the nuclear forces: 
Let a nucleon have zero potential energy when beyond the nuclear 


protons LT" [ efinec Jj :alc ulation, the existence of eloctrostatio forces requires that 
G? * “ SIgned ^ "to °°oup2d by only 


26 


( 1 711 ) 
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radius r 0 A 1/3 and the constant value — V 0 when inside (fig. 6). This 
situation is called a square well potential. The assumption of this shape 
is better justified than appears to be the case at first glance. 

The spatial part of the phase space is 



. (45) 


and the number of nucleons in this region having momentum between 
p and p -f- dp is 


dn = 4v . 4rirp' 


dp 

w 


(46) 


V 0 - 34*10 

mass units 



T 


0 


-3 


8*10 mass units 


Fig. 6 


The total number of nuclecns is given by 

A = fdn = 16* p f^fdp = 16™ = 64 (47) 

from which it follows that 

<«•> 

Since A cancels out, the maximum momentum is independent of the 
number of particles, as long as the number is not so small that statistical 
methods do not apply. The ma xim u m kinetic energy, = plJ2M, is, 
according to (47a), 26*1 when expressed in thousandths of a mass unit. 
Taken in connexion with the experimental fact that the energy of 
removal of a particle is invariably about 8 (in the same units), it follows 
that the depth of the potential well is 34 units, independent of the 
number of particles. But the width of the well, v*^ 3 , changes with 
increasing atomic mass. 

The total kinetic energy is obtained by integrating over all values 
of the momentum: 

f v max rlW IGttVP 4 dp StT Vp 5 ,^ 3 Ap 2 , nxI 

=7 0 2. M dn = 7o ~^Mh 3 5 Mh 3 10 M ' 
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The numerical value is but slightly different from the value of w 0 , but 
with positive sign. Here, then, as in the case of the hydrogen atom, 
the kinetic energy turns out to be about half the potential, but in this 
instance the relation is more a matter of coincidence. 

In the more refined computation which takes into account the differ- 
ing values of potential for charged and uncharged particles, the number 
of neutrons comes out to be 


TiT 8nV S 

xV ~ n, 




m*x> 


(49) 


and the number of protons 


7 S7TV 

Z ~ 3 h 3 P p ' 


EQAX* 


(50) 


The sum must be A. From (47), (49) and (50), 


a /21V 3 /2Z 

Ptl, max ^ ^ Pbmj Pv,m*x ^ Pm*x> • • (51) 


and the total kinetic energy becomes 




jo2f + ¥», J 


Taking 


a(4)XIT4(?)“ + (?)1 


M _ 1 , N - Z 22 N-Z 
A - 1 + N + Z* ni A= 1 -WTZ 


(53) 


— m powers oi tne traction, we obtain the kinetic enerev 

per particle: 87 

_ 

(64) 


-f= + 15-7 + 8-6 



• • » 


appear *«“ ' f the ^ form as those in the formula 

The tfrm shows that the kinetic energy is exactly 
equal to the negative of the total energy, as already remarked The 

kinetic'ener ~ W + ^ ^ °° nt ^ 3 — WembTeamoJof 
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12. Independent Particle and Shell Models of the Nucleus; Nuclear 
Moments. 

The liquid drop model treated as a Fermi gas accounts in a concise 
way for many properties of nuclei, just as the treatment of the electron 
shells as a Fermi gas accounts for many features of atomic structure. 
However, details such as the mechanical and magnetic moments of 
nuclei fall outside the province of the drop model. In many cases, the 
finer details can be deduced from a model in which a single particle 
is considered and its motion in the schematized potential field of the 
others is studied. The question of nuclear moments is one that comes 
particularly within the scope of such considerations. 

The experimental fact that the ee-nuclei have zero moment (§2) 
indicates that for pairs of like nucleons (proton-proton or neutron- 
neutron) the angular momenta cancel by setting themselves in the 
antiparallel position. At the same time, the integral values found for 
the angular momenta of the four stable ww-nuclei at the beginning of 
the periodic table suggest that the parallel orientation occurs for a 
proton-neutron pair. Apart from these four instances, it seems natural 
to ascribe the mechanical as well as the magnetic moment to the un- 
paired nucleon which is added last. The values of the mechanical 
moment of the ue- or eu-nuclei are relatively easy to determine from 
the hyperfine structure, but it is more difficult to measure the associated 
magnetic moment. Inasmuch as the nuclear fluid must possess kinetic 
energy, part of the moment must be the result of its rotation and the 
rest must come from the spin of the elementary particles themselves. 
The magnetic moment of a nucleus belonging to a diamagnetic atom 
or molecule can be determined by a refinement of the atomic beam 
method (p. 665) developed by Rabi. However, this method cannot be 
applied to neutrons. Bloch and Alvarez were able to determine the 
value for the neutron in the following way: A magnetized block of 
iron is used to align (“ polarize ”) the spins in a beam of neutrons, 
while a second magnetized block serves as analyser. An alternating 
magnetic field, applied in the intervening space, disturbs the alignment. 
This disturbance is greatest when the frequency of the field coincides 
with the Larrnor frequency of the precessing moments.* The most 
appropriate unit for expressing nuclear magnetic moments is that 
corresponding to the revolution of a proton with angular momentum 
Owing to the fact that the proton is 1836 times as massive as the 
electron, equation (38), p. 464, yields for this quantity the value 

fx y = /*,/1836 = (5-048 ± 0-006) X 10" 24 erg/oersted. 
fx s is called one nuclear magneton. The experiments referred to above 

* In quantum language, this would occur when the value of hv is equal to the 
diUerenc© in energy of the two orientations in the field. 
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yield the following values for the magnetic moments of the proton and 
the neutron : 

ti P = 2-7927 = -1-9130/*,. 

In each instance the gyromagnetic ratio (p. 671) is double the number 
above, since each of these particles has spin 1/2. With regard to the 
signs, the -f- sign for the proton indicates that the spin and the magnetic 
moment must have the same direction (-f- charge rotating), while the 
opposite must be true for the neutron. The sign is determined experi- 
mentally from the hyperfine structure. 



Fig- 7 



(6) Proton 


If there is but one bound nucleon, the resulting magnetic moment 
may be calculated according to the method of p. 670 by considering 
the resultant of the orbital moment l and the spin moment s, both in 
units of A/2 tt, precessing about their resultant, j. Starting with an 
arbitrary angle between l and s, the cosine is expressed in terms of j, 
l and s, with each quantity x 2 replaced by the quantum-mechanical 
square, x(x +1). Finally, the angle is taken to be either 0 or 27 t, the 
only possible values. Calling the gyromagnetic ratio for the spin g,, 
and leaving the value g t for the orbital ratio open, the result is 

Forj = l + £, f j. — g l [-\- $g s 

The orbital ^-factor is set equal to unity for protons; but neutrons, 

Decause of tie absence of charge, can have no orbital moment and so 
9i taken to be zero. 
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Using the above experimental values of g 3 (5-58 for protons, -3-82 

for neutrons), and plotting the values of p as functions of 9/ we get 

the so-called Schmidt-Schiiler curves on which the observed moments 

should fall. Actually, only a few points fall near the curves themselves, 

most of them scattering over the space between the “ Schmidt limits ”! 

Taking account of errors of measurement, no point falls definitely outside 

this region. The reason for the scatter is to be sought in the fact that 

the unpaired nucleon is not completely independent of the others, as 
was assumed. 

If we wish to compute the energy of excitation or of complete re- 
moval of a nucleon following the method used for the electron shells, 
we must make some sort of assumption regarding the trend of the short- 
range nuclear forces. A possibility that takes into account the short 
range of nuclear forces is the square-well potential function, mentioned 
on p. 770, for which V is zero beyond a certain distance a and equal 
to a constant value — V 0 within this distance. Using this assumption, 
the simple case of the deuteron will be treated. The Schrodinger equa- 
tion is 

M + -J/ 7 [E - V(r)]ip = 0 (55) 


For the deuteron, M is equal to the “ reduced mass ” (p. 651), \M, r 
For heavy nuclei, M can be taken to be that of the last bound nucleon, 
or M B . Since the magnetic moment of the deuteron turns out to be 
very nearly equal to the sum of the magnetic spin moments of proton 
and neutron, there is no orbital moment. This means that the quantum 
number corresponding to orbital motion is zero and the 1 p function is 
spherically symmetric. If we set 



we get a simple equation for u: 


dhx 

d ^ 



8t t 2 M 
h 2 


[E - V(r)]u = 0 



In order to avoid difficulties with the algebraic sign, we set the negative 
binding energy E equal to — W and obtain 


d 2 u 
dr 2 + 


87 t*M 
h 2 


(V 0 — \V)u = 0 for r < a\ 


dhc 

dr 2 


8tt m 


JYu = 0 


for r > a.J 
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The integrals are 

for r < a: u = A sin k x r 

for r> a: u = Be~ k ' r 


(*. = h V2M(V 0 - IF);] 

277 . I 

(k 2 = h V2MW. I 



These two solutions must be so fitted that for r = a both ip and dip /dr 
join continuously (cf. p. 723). The requirement of continuity of ip, 
and hence of u , furnishes a relation between the constants of integration 
A and B. This still leaves one of them at our disposal; but the normali- 
zation condition 


S'l j2 i r ) dr = 1, (60) 


which expresses the fact that the probability of finding the particle 
somewhere in the whole of space is unity, fixes the value. 

Continuity of the derivative dip/ dr and thus of du/dr leads to an 
additional condition that does not involve A or B. T his condition may 
be evaluated by the following artifice: 

If two functions f x (r) and / 2 (r) have the same value at r = a, (i.e. 

jf/iM == / 2 ( a ))> an d also if their first derivatives with respect to r have 
identical values at this point (//(a) = / 2 '(a)), then 


d 


dr 


r-a 


A(o) “ dr i0g/2(r) 


r-a 


ftW 

/»(«)' 


(61) 


Applied to the two solutions (59), this yields 


&i cot k^a = — k 2 . (62) 

The depth and breadth of the potential well are unknown, but the bind- 
ing energy of the deuteron is known to be W = 2*19 Mev. Equation (62) 
does not yield F 0 and a separately, but only a relation between the two. 
Inasmuch as IF is s mall in comparison with F 0 (cf. p. 770), we have, 

from (59), kjf^ = V W / F 0 . According to (62), this means cot k^a = 0, 

or kja = w/2. From (59), k x a = 2tti/2 MV 0 a 2 /h y so that 



h 2 

32 Ma 2 ' 



Determination of the value of A from the normalizing condition and of 
f : boUn< l ary c ? ndition on /M “ of no further interest at this 

a Rn 6 r al ^ e ° f r ° ?? lace ofain e 9 u ation (63), we obtain 

about r 60 < m thousandths of a mass unit) for F 0 , which is of 

the nght order of magnitude.* 0 
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For higher quantum states the expression for i)j also involves surface 
harmonics multiplied by a function of r. As for atoms, an orbital 
quantum number l is introduced in order to describe the energy states. 
A lengthy calculation yields the following term sequence for the case of 
the square- well potential fimctions: 

l.s 2 p 3d 2s 4/ 3 p 5 g 4 d 6 h 3s 

2 6 10 2 14 6 18 10 22 2 

The numbers in the lower line are the maximum numbers of nucleons 
that can reside in each state, taking account of the Pauli Principle as 
applied to the two possible orientations for the spin. These numbers 
hold for protons as well as for neutrons, since these are really different 
particles which are distinguishable by their charge. 

The assumption of a sharp-edged 
square well is obviously a bold ideal- 
ization of the actual situation. A more 
realistic assumption which is at the same 
time amenable to calculation is the oscil- 
latory potential 

V = V 0 - . (64) 

The depth of the well is characterized by 
the constant F 0 , while the width is the 
opening in the parabola, which depends 
on the value of co 0 (fig. 8). While this representation is far from the 
actual situation for large values of r, this is of little consequence because 
the particle is seldom found at these large distances. In this case the 
term sequence and particle numbers turn out to be: 

Os Ip 2d 2s 3f 3p 4g 4d 4-s 5 h 5/ 5 p 

2 6 10 2 14 6 18 10 2 22 14 6 

A somewhat better approach to the truth is obtained by averaging both 
potentials by taking the mean of each pair of closest-lying levels, which 
are given by the same value of l. 

The spin, which has the effect of bringing about a further splitting 
of the energy levels, has not been taken into account up to this point. 
The spin-orbital interaction is greatest here, so that the situation is 
analogous to the ^’-coupling of the electron shells rather than to the LS 
coupling to the other nucleons, which was taken into account in the fore- 
going treatment by means of the nuclear potential. As in the case of 
the electron shells, the splitting must increase with increasing /. If a 
direct proportionality between the splitting and / is assumed, the partial 
levels of one term occasionally fall between those of another and a new 
grouping results which corresponds neither to that of the square well 
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nor fche oscillating potential function. There are gaps of considerable 
size in the energy levels following these numbers of protons or neutrons: 

2 8 20 ( 28 ) ( 40 ) 50 82 126 

A number in parentheses means that the gap following it is present 
but not as marked as for the others. 

What do these gaps signify ? In the theory of electron shell structure, 
the closed shell of the noble gases is characterized by the fact that 
the energy levels nearest the ground state are especially far above this 
state; e.g. the electron is particularly tightly bound. For the alkali 
metal immediately following, however, the electron is especially loosely 
bound. In exactly the same way, a nucleus whose number of neutrons 
or protons corresponds to one of the above so-called “ magic n um bers ’* 
is particularly stable and symmetric. This affects all nuclear properties. 
For example, we note the large number of isotopes of Sn (Z = 50). 
The quadrupole moment of such nuclei is very small, increasing with 
the addition of each nucleon (cf. fig. 5). Thus the magic numbers 
correspond to closed shells, just as do the numbers of electrons in the 
outer shells of the noble gases. It must be remarked that 50 neutrons 
as well as 50 protons gives a particularly stable nucleus. This makes 
plausible the unusual stability of “ doubly magic ” nuclei, of which 
2 He 4 is the first. 


13. Beta Transformations and the Neutrino Hypothesis. 

The established fact that the energy distribution of the electrons 
emitted in ^-transformations is continuous, is in sharp contradiction 
to all other observations which show that the nucleus, like the elec- 
tronic shell, has a number of definite quantum states of energy. If a 
number of nuclei were to disintegrate with the emission of electrons 
only, the new nuclei would have a continuously varying content of 
energy. How can such nuclei, which occur in the radioactive series, 
emit a-rays all having the same sharp range? Here we have the choice 
between only two courses: give up the conservation of energy for B- 
transfonnations, or assume that a particle is emitted which escapes 
observation. So long as there is no other evidence for the latter assump- 
tion, it amounts of course to no more than a plausible way of getting 
wound the former disagreeable assumption. There is, however, other 
evidence that another particle is emitted. We saw (p. 744) that, accord- 
mg to observation, nuclei of odd atomic weight have moments which are 
multiples of 1/2. In ^-disintegration the atomic weight is not altered 

i« 1 /9 6 T 0m f nt ch ? n P h 7 a multiple of 1/2, since the electron spin 
ThU .if* < 2i 7 - a whole - n umber orbital moment can be added to this. 
mcS nfA ^surmounted by giving the hypothetical particle, like 
most of the stable particles, a spin of 1/2. Its mass must be much 
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smaller than that of the electron, presumably less than 1/5000 the 
electron mass. In nuclear reactions where such particles must be 
assumed to be produced (e.g. the decay of the neutron) the charges 
of the remaining products are found to balance. This, together with the 
fact that the new particle produced no observable track, indicates that 
it has zero charge. This particle, whose existence had to be assumed in 
order to retain the conservation of energy in a number of observed 
nuclear reactions, was named the neutrino. 

Recent cloud chamber observations of 0-decay, undertaken for this 
purpose, show that the conservation of momentum is not fulfilled for 
the emitted electron and recoil nucleus alone. More recent experiments 
for detecting possible neutrino capture by protons now have yielded 
direct evidence for the existence of the neutrino. 

A further circumstance to be considered is the absence of any anni- 
hilation radiation in K-capture (§7). Meitner performed an experi- 
ment which showed that neutrinos are not simply y-radiation. The 
heat developed in a 0-reaction was measured in a calorimeter which 
was arranged to trap any y-radiation which might have been produced. 
Had the heat been produced by y-rays, its amount would have corres- 
ponded to the maximum energy of the electrons. It was found, however, 
to correspond to the average energy of the electrons. 

Neutrinos have such small interaction with matter that they appear 
to pass right through the solid earth without appreciable absorption. 
The question of what happens eventually to their energy is a very dis- 
turbing one, particulary since it is far from trivial in amount. It is 
estimated, for example, that of the total energy production of a uranium 
reactor, some 2000 kilowatts is carried off by neutrinos. 


14. Mesons. 

Besides the new particles already mentioned — neutron, positron 
and neutrino — there exists a group of particles whose masses lie be- 
tween that of the electron and that of the proton. These particles, 
called mesons , occur both with positive and with negative charge, and 
certain ones in the neutral state as well. The first indication of the 
existence of mesons was obtained in cloud-chamber work with cosmic 
rays. Tracks were found that were definitely thinner than those of 
protons and they were formerly ascribed to electrons. But according 
to the photographs the loss of momentum of these particles in passing 
through a lead plate is only one-tenth of that to be expected from an 
electron. This difficulty can be avoided by assuming that the particles 
have a mass of about 220 times that of the electron. Yukawa had 
already deduced the existence of such a particle from considerations 
which at first sight seem more formal. For the electrostatic field we 
have Laplace’s equation = 0. Since nuclear forces must fall away 
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very rapidly with distance, he wrote down an analogous differentia] 
equation 

A<t> - k 2 <f> = 0, (65) 

which, with k = l/r 0 , yields the solution <j> = e~ r,rt /r and also gives 
the required short range of nuclear forces r 0 , approximately equal to 
10" 13 cm. If, now, by analogy with the wave equation for electrical 
forces we write down the wave equation 

^-^- 58-0 ( 66 ) 


for the field of the nuclear forces, comparison with the theory of de 

Broglie and Schrodinger shows that this is the wave equation for 

particles with rest mass khftnc, which corresponds in value with the 

mass which we have to assume for the meson.* An essential feature 

of Yukawa s theory is that there is an interaction between mesons and 

light particles which gives rise to transformations of one into the other, 

just as there are transformations of neutrons into protons and vice 
versa. 

Experiments with mesons produced artificially by impact of high 
energy (several hundred Mev) protons to a-particles on heavy nuclei 
have shown that the mesons involved in nuclear forces are more massive 
than the ones originally found in cosmic rays. The heavier mesons are 
called 7r-mesons. They occur with charges of -f-e, — e and 0. The tt+ and 
it have a mass of 273 m 0 , while the tt° has a mass of 264 m 0 . The lighter 
ones, called /z-mesons, do not produce nuclear disintegrations. They 
occur as ^ and /z- and have a mass of 207^. It will be remembered 
that X-ray “ Bremsstrahlung ” is caused by the acceleration of elec- 
trons. In an analogous way, 7r-mesons are conceived to be formed when 
nuclear particles are accelerated. Both the t t- and the /x-mesons are 
unstable. The half-life of a 7r-meson amounts to about 2 X 10~ 8 sec • 
that of a /z-meson is about 100 times longer.f The current theories 
seek to account for these and other possible mesons as excited mass- 
states of a single kind of particle. 


far , , n ??' r f I ? tivist i c wave r mechanics, which we have always used thus 

tTS? cT^hen ^i?h { F TCOS {U *i7 0) and haVing a VeIooifc y * (^med *2 

equation (28) d vL lu E = m ^ /2, co “P arisori of equation (66) above with 
frotuZ a/2wt \ The ^ivistio wave equation 

(66) for a W i lC ^. Was fir ? t ^eloped by de Broglie, has the form 

pariaon JSq- forces * * **“8 replaced by 4 This gives by com- 

equation is callid fo'r. m6SOnS partioles of ver y hi 8 h velooity, the relativistic 

»t iI he 1 1 ®rSe rc m^n nt „^L kal, - life re , fer ( ed 10 a 8 y rtem •« "tooh the particle ie 

stationary observer u Pf>r S° • that ° f li « ht , the lifetime referred to a 

y observer may be as muoh as 60 times these values. 
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15. Brief Survey of Our Knowledge concerning Cosmic Rays. 

As far as the amount of converted energy is concerned, chemical 
processes may be characterized as the “ physics of electron volts 
nuclear transformations as “ Mev physics ” and cosmic- radiation 
phenomena as “ million Mev physics ”.* It is to be expected that an 
increase of many orders of magnitude in the energy involved brings 
with it entirely new phenomena. The Erst thing that had to be done 
in order to explain the nature of cosmic radiation was to distinguish the 
effects of the primary rays from those of the secondary. The effects of 
the primaries must increase monotonically with altitude, reaching a 
constant value beyond the atmosphere. The effects of the secondaries, 
on the other hand, must attain a maximum at a certain height; for 
they originate in the air itself and the primary radiation which causes 
them will be progressively absorbed. 

Apart from a small contribution of secondary particles scattered 
from below, the primary rays can be studied in almost pure form at 
great altitudes. Experiments have been carried out in aircraft up to 
about 10 miles, recording apparatus and photographic plates have been 
taken to heights of about 20 miles in unmanned balloons, and high- 
altitude rockets have carried instruments to over a hundred miles. 
The principal effect of the primary particles is the production of single, 
very energetic collisions which show up on a photographic plate as 
“ stars ” in which many nucleon and 7r-meson tracks radiate from the 
point of impact. The number of ion pairs simultaneously produced by 
these particles is sufficient to actuate an electrometer of high sensitivity. 

At the present time, the most important means of investigating 
the primaries is the photographic- plate technique. Here the tracks are 
produced by fragments of silver or bromine atoms which have been 
disintegrated by the intrusion of a primary particle. What is the 
nature of these particles? They must be charged, since their distri- 
bution is found to depend on the geomagnetic latitude of the place of 
observation. To-day it can be considered well established that the 
majority of the primary particles are protons. There are also heavier 
particles present, ranging up as far as Fe nuclei, a-particles being 
especially numerous. Because of their greater range, the protons pre- 
dominate at greater depths. 

At the earth’s surface and at moderate altitudes only few primary 
particles are observed, most of the rays being a mixture of quite diverse 
secondary particles. Although very energetic secondary nucleons and 
77 -~-mesons are able to produce small stars, these particles constitute 


•In American publications the abbreviation Bev is sometimes used to denote 
a unit of 10 9 electron volts. It must be remembered that the term 4 billion stands 
for 10* in America and for 10 12 in Britain. 
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only a small part of the secondary radiation.* It has been known for 
a long time that the radiations detected at lower levels in the atmos- 
phere consist of two components — a penetrating (hard) component and 
a softer one. The separation of the components and the determination 
of their properties have been the object of an extremely large amount 
of experimental and theoretical work. It has been shown that the hard 
component consists of /x- mesons while the soft component is made up 
of less energetic electrons and y-photons occurring in the form of 
extended showers stretching over distances often exceeding hundreds 
of meters. 

The /x-mesons, formed by the disintegration of 7r-mesons, have a 
lifetime long enough to permit them to penetrate deep into the earth. 
Their ionizing power is so slight compared with their energy that a 
particle having an energy of 10 u ev is able to penetrate 500 metres of 
water. The electrons and y-photons begin to appear even at the very 
top of the atmosphere. Here the short-lived neutral 7r-mesons decay into 
two y-photons. By pair production and Compton processes these give 
rise to fast electrons. The electrons, in turn, produce y-photons when 
retarded, and so the process continues. The result is a cascade shower 
in which the available energy is rapidly shared among a large number 
of particles of low penetrating power. 

Special triggering arrangements are needed to catch the beginning 
of a large shower in a cloud chamber experiment. If the point of origin 
happens to be located in the material in a large chamber, the electro- 
meter will show an abrupt impulse as for an explosive shower (see 
above). Usually only tracks of individual electrons or positrons are 
seen in the cloud chamber, and sometimes /x- meson tracks as well. 

The biggest question of all is that of the origin of cosmic radiation. 
At first it was thought that the rays were due to some mysterious 
instance of conversion of matter into energy. This is known to be false, 
if only for the reason that the annihilation of even the most massive 
atom would not be enough to account for the energies of up to perhaps 
10 1 ev that are encountered. In the light of present experience with 
particles accelerated in the laboratory, it seems plausible that the cosmic 
particles obtain their energy from stray magnetic fields in space. The 
particles may in part originate in the sun. The magnetic fields, because of 
heir enormous extent in space, need not be strong, but might be more 
concentrated in the neighbourhood of certain stars, especially those 
showing strong emission of radio waves. It is still very much an open 
question as to which of the possible mechanisms actually obtain. The 
tact that the observed intensity is the same in all directions is in accord 
with such theories, for the particles are probably under way for thou- 
sands of years before reaching the earth. 

to *** repeU6d by ^ nuciear charge and so do not ordinarily lead 
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CHAPTER XLIII 


Further Matters connected with the Geometric Optics 

of Light and of Electrons 

L Condition for Absence of Distortion and its Relation to the Sine 
Condition. 

On p. 406 we deduced the condition governing the point-for-point 
imaging of an element of area normal to the axis with the provision 
that axial points be imaged without distortion by means of wide 
pencils of rays. Inasmuch as the squares of the linear dimensions of 
the area were neglected in this derivation, the resulting sine condition 
guarantees only the imaging of a small area, e.g. the field of view of a 
microscope. But if requirements in the quality of the image near the 
boundaries are less stringent, the range of validity of the result may 
be considerably extended. We wish now to consider the other extreme 
—the imaging of a large field of view by means of narrow pencils. The 
limiting case is that of the pinhole camera, which uses no optical unit 
(such as a lens) at all. The sharpness of the image depends solely on 
the size of the aperture. We need not use such narrow bundles of rays 
if we use an optical system, but they must still be fairly small if other 
errors such as astigmatism (p. 409) are to be minimized. It is a chal- 
lenge to the skill of the optical designer to keep these other defects 
small so that good images result even when a large field of view and 

wide pencils are employed. We shall not concern ourselves with these 
other errors. 

Let the cross-section of the entering bundle of rays be limited by 
a diaphragm AB y fig. 1, assumed to be placed in front of a lens system 
S, indicated by the two curved surfaces. In this case AB is also called 
the entrance pupil. If there happen to be lenses on the side toward the 
object, the entrance pupil is found by locating the image of the dia- 
phragm as formed by these lenses. The reason is that a ray passing 
through the edge of the entrance pupil must then pass through the 
edge of the diaphragm, since entrance pupil and aperture correspond 
to each other. In the same way, the boundaries of the pencil of rays 
on the image side are determined by locating the image of the dia- 
phragm as formed by any lenses lying on this side of it. This image of 

the diaphra^n opening is called the exit pupil and is represented by 
dBm the figure. J 
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Further, the rays drawn from individual points of the object or of 
the image to the centre of the corresponding pupil (0 or O' in the 
sketch) are called principal rays. These rays are taken to represent the 
pencils traversing the pupils from each of the various object points. 
Let P, Q, R represent three points located on a perpendicular to the 
optical axis, and let P', Q\ R' be their respective images. What con- 
dition must hold in order that object and image be similar? Obviously 
we must have P'P' : Q’R’ = PR : QR . I n terms 0 f the labelling used 
in the diagram, this means 


tan^'/tanwg' = tanuj/tanwg, 


or the condition for similarity (distortionless imaging) becomes 


tan u t 
tan u/ 


= const. 



If we compare this with the sine condition, the two requirements 
seem at first sight incompatible. However, at least within certain 



limits, this is not so. The sine condition refers to the angles between 
the axis and the various rays drawn from an axial point R through the 
several points of the entrance pupil, as indicated by v lf v 2 , etc., in fig. 1. 
In other words, the system S must be figured so that it fulfils the sine 
condition for the plane PQR as a whole and the tangent condition for 
the principal rays. The realization of good definition is the first con- 
sideration, while in many applications some distortion can be tolerated. 
For example, if the system is so corrected that the sine condition is 
fulfilled for the imaging of the entrance and exit pupils, there will still 
be distortion in the final image. It is easy to compute how a rect- 
angular network placed in the plane P'Q'R' will be imaged when, in 
place of the tangent condition, the sme condition is satisfied. Ke- 
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versing affairs now, and placing the distorted grid at PQR , its image 
at P'Q'R' will be rectangular. This was used by Abbe as a sensitive 
test of the sine correction of microscope objectives. He placed the 
entrance pupil AB — a small hole — in the object plane of the objective 
to be tested, and noted whether the image formed at P'Q'R', the plane 
for which the figure was computed, was free of distortion. Fig. 2 
shows one quadrant of such a test figure. 



Fig. a 


2. Focal Length of an Electron Lens. 

The parallelism between ray optics and the mechanics of particles 
explained on p. 693 shows that in representing the orbit of an electron 
in an electric field by a ray of light in a homogeneous medium we must 
think of the medium as having an index of refraction proportional to 
the speed of the electron. This follows from a comparison of equations 
(15) and (16). As long as we confine attention to geometric electron- 
optics, the factor of proportionality is immaterial and equation (23) 
is suitable for a wave-mechanical treatment. If the electrons owe their 
velocities only to electric fields, i.e. if their initial thermal motion can 
be neglected, then n may be set equal to the square root of the potential 
difference V that exists between the region considered and the cathode. 
As long as only rays near the axis are considered, a rotationally sym- 
metric electric field (fig. 3) corresponds to a series of centred spherical 
surfaces as already treated on P . 405. The sole difference between this 
case and ordinary optics is that here the index of refraction changes 
continuously, whereas m optics spherical surfaces usually separate 
regions in each of which the index is constant but different from one 

dMc^lW n6Xt * P’S . discont i nuit 7 d °<* not introduce the slightest 

culty as long as the distance between shells can be negleoted. This 
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of course, makes the result a fairly rough approximation. Successive 
application of equation (25), p. 405, to a series of k layers bounded by 
centred spheres gives, for the power , 



1 


6 + 1 

L n, </>/ 


i 



On the other hand, from equation (21), p. 405, the power of 
spherical surface of radius p and discontinuity of index amounting 
An* is . 

( 3 ) 


n iP 


♦V 0 + V 



If n changes continuously along the axis, 





In the electrical case, n = \/V, and so 


1 r "*l dVVj 1 r +co lgF 1 


Ax. (6) 




The integration may be performed between the bmits -=o and +« 
because dF/dx vanishes in the field-free space and there is no con 


S’ to 
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tribution except where dV jdx differs from zero. The quantity V' de- 
notes the constant potential that exists at a great distance beyond the 
lens. 

surfaces may be found 
in terms of the change of V measured along the axis by using dif- 
ferential geometry and the potential equation A V = 0. This fact 
brings out a fundamental difference between ordinary optics and elec- 
tron optics which complicates the process of computing electron lenses. 
In the case of light, the curvatures and indices of refraction of lenses 
can be selected independently, and so by skilful choice the lens errors 
can be compensated. In electron optics, however, the radii are already 
determined by the change of index along the axis. The result of the 
somewhat cumbersome calculation, which may be transformed by 
partial integration, takes the following forms, since dVjdx vanishes 
at great distances: 


The radii of curvature of the eqmpotential 




d 2 V 1 
dx* y/V 



V-^dx 





As we can see by considering fig. 3, the repeated neglect of the 
distance between lenses must result in a considerable total error, but 
the optical computation taking these distances into account becomes 
very troublesome. Generally, in electron optics, the calculation of 
electron paths leads more quickly to the result than does the formal 
comparison with ordinary optics. Moreover, the former has the ad- 
vantage that all the required equations are already at hand. For this 
reason we shall compute the focal length only in this way. As an 
approximation for narrow pencils we equate the projection of the 
orbital velocity on the axis to this velocity itself. Then 

( 8 ) 

where 7 is the potential referred to the cathode. 

Let the radial component of the field at a distance r from the axis 
be denoted by E f . Then, since 

dr _drdx _ dr 
dt dx dt V dx* 

we have 




( 9 ) 
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In a charge-free space, E r can be expressed in terms of E x , the com- 
ponent of the field along the axis. Imagine a cylinder of radius r and 
short length l described about the axis, and calculate the surface 
integral of the field over this cylinder. It may usually be assumed that 
there is no charge within the cylinder since the space charge due to the 
electrons is negligible. Then the surface integral is 


27rrIE r -f- 7 rr 2 l 


dEz 

dx 


— 0 or E r 


r dE x __ r d 2 V 
2 dx 2 dx 1 ' ' (10) 


The differential equation of the electron orbit then becomes 

dr\ r d 2 V 


d 

Vv d* 


Vv~ = 


4 dx 2 ' 


( 11 ) 


Examination of this equation shows that: 

1. The specific charge e/m does not appear, so that the result holds 

for charged particles of any kind. 

2. The equation is homogeneous in F, so that changing the poten- 

tials of all electrodes by the same amount leaves everything 
the same. This shows why an electrostatic electron micro- 
scope is not sensitive to variations in voltage. 

3. The equation is homogeneous in r and x. Thus, if all the dimen- 

sions of an electron microscope are changed by a given factor, 
the magnification remains unchanged. 

In order to integrate the equation of the orbit, write it in the form 

dh , dV{x) dr <^F(x) 

dx 2 * dx dx dx 2 




This shows that we are dealing with a second-order linear differential 
equation whose cofficients are functions of x and may be determined 
by measurement or by calculation. We can bring the equation into 
simpler form by putting 

r = R(x)f{V), (13) 


determining /(F) 
readily gives 


so that the term in drjdx vanishes. 

/(F) = F~* . . . 


The calculation 




d 2 R 

dx 2 


3 

16F 2 





The integration may be carried out by successive approximations, as 

was done in the case of equation (36), p. 101- . , ,. 

In order to compute the focal length, consider a ray in the object 
space, parallel to the axis and located a distance a from the axis. Its 
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intersection with the axis in the image space determines the principal 
focal point F'. From the definition of focal length (p. 399), 



Since the focus lies in a region of constant potential F', (13) yields 



In front of the lens we have the constant potential F 0 , and so 

a = R 0 V 0 -i (18) 


If we now wish to compute ( dR/dx ) f . from equation (15), we can, as a 
first approximation, take the value R = aVJ which holds to the left 
of the lens. By integration we then get 



where the upper limit may be taken to be infin ity because the focus is 
in a field-free region. The power of the system then becomes 



jlFo* 

16 y 


l f +<c /^ZV_L 

'*•/_«, \dx) V 2 


dx, 



For a single lens the potential has the same value on both sides, and 
a better expression for the power of a single lens is obtained by re- 
placing dVjdx by E: 



Similarly, the power of a lens formed by 
magnetic field H is found to be 


a short, rotationally symmetric 




H 2 dx t 


V being the potential through whioh the electrons have fallen. 


(22) 



CHAPTER XLIV 


Piezoelectricity and its Applications 


1. The Phenomenon of Piezoelectricity. 

When an ionic crystal is deformed an electrical volume-polariza- 
tion results, and this, according to the discussion on p. 281, makes itself 
known by the appearance of charges on the surface. In a rough way, 
the occurrence of this piezoelectric polarization may be pictured by 
t hink ing of a regular tetrahedron whose centre is occupied by an ion 

carrying four positive charges, while each 
corner is occupied by a singly charged 
negative ion (fig. 1). Inasmuch as the 
centre of gravity of the negative charge 
coincides with the position of the positive 
charge, the structure as a whole has no 
dipole moment. If, now, the central ion 
is held in position while the top point 
is pushed downward, a dipole moment 
will obviously arise. In simple crystal 
lattices such as NaCl there is no resul- 
tant dipole moment per unit volume, since the symmetry of the lattice 
insures a mutual compensation. The fundamental requirement for 
the appearance of piezoelectricity is the presence of what is called a 
polar axis — an axis such that the plane normal to it is not a plane of 
sy mm etry of the crystal. The relation between the deformation and 
the resulting polarization is linear. We can therefore write for the 
components of the polarization vector: 



Fig. i 


p x 

Pv 

p. 


Yn e n + yi2 e 22 + Yis^ + Yu e 23 + Yis^i + Yw'hi’ 

Y21 e u + Yi2 e 22 + Y23 e x> + Y2i e 23 + Ya^l + Y2C e 12’ 

y 3 ! e n + y 32 e 2 2 + Yxi^xs + Yzi e 23 + + V36 e i2- , 


( 1 ) 


The e ik are the components of the strain tensor as defined on p. 164; 

the coefficients y ik are called the -piezoelectric constants. Depending 

upon the degree of symmetry of the crystal, the actual number of 

constants is reduced. For quartz— the most important of such crystals 

in practice — the scheme of coefficients referred to a co-ordinate system 
r 702 
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conforming to the axes of symmetry (fig. 2) has the following ap- 
pearance : 


Yu —Vn u 

ooo 

ooo 



In the place of the components of the strain tensor one may intro- 
duce those of the stress tensor.* In a crystal, however, the relation 
between the two tensors is much more complex than in isotropic 
bodies, where two elastic moduli — for example, E and a — give the com- 


r 





plete connexion. In the general case, the symmetric scheme of co- 
efficients relating the six components of the stress tensor with the 
mx coefficients of the strain tensor contains 21 different constants 

Expressing the e <Jfc in terms of the P it by means of these relations, we 
get: 


—P x = a U P U -f -f. -f- 8^23 + S^Pn. + S 16 P 12 , 

— p v = S 2 1 P 11 + $22^22 + 8*^33 + Va + 

~ P * = + S 32 P 2* + + 834^23 + + §36 ^1* 



» “* *■ “ d u ” *•* 


the 
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Reciprocally, if an electric field is applied to a piezocrystal, the 
resulting polarization causes elastic stresses and deformation. When 
the components of the strain tensor are expressed in terms of the com- 
ponents of the field strength, the coefficients 8 ik appear. The deduction 
is similar to that of equation (56), p. 176. We calculate the expression 
for the energy of the crystal in the electric field. The results for the 
components of the strain tensor are 


e n = $u E x -T S 2 i E v -T 

e 22 — ^I2^x + ^>22 Ey + 

^33 — ^13 Ex + %v\E y + 833 E zy 


®23 — ^ll E r 4- &2\Ey + 834^,, 1 


^31 — $15 E x + ^2o Ey + 835 Eg, 

^12 = &l(\Ex + + ^36 E g . 



For the negative pressure components a scheme analogous to (4) is 
obtained. The quantities y ik appear as coefficients of the E t \ for 
quartz we have: 



^23 — y \\ E x , 
Eji — Yu E v> 

P\2 = Yn E v , 


Cji — Sjj E. 



-«ii E x > 



^23 KE X y 


e 31 — 
C '12~ 


^14 Eyy 

2 8 n E v . 



The position of the co-ordinate system underlying these relations is 
that of fig. 2. For quartz, the numerical values in C.G.S. units (alge- 
braic signs for right-handed quartz) are : 

Yu = 4-77 x 10 2 * 4 * * * , S u = 6-36 X 10" 8 , 

y 14 = 1-23 X 10 4 , S 14 = -1*69 X 10“ 8 . 


2. Application of Piezoelectricity to the Stabilization of Oscillating 
Circuits. 

The most important application of piezoelectricity is based on the 

resonance between the frequency of an oscillating circuit and the 

mechanical vibration of a bar or plate cut from a piezocrystal. Let us 

consider the simplest case of a bar-like piece cut from a crystal as shown 

in fig. 2. Suppose that the electric potential is applied to the long sides 
of the bar, so that the field is in the direction of the x axis. As a result 
there will be changes in length in the y direction and at resonance the 

bar will experience strong longitudinal vibrations. We must first 
compute the natural vibration of a bar of length l. This can be done 
in a way quite analogous to the calculation of the vibration of a string 
carried out on p. 183. Here again we can treat the natural vibration 
as a standing wave resulting from the superposition of two waves 
travelling in opposite directions. The speed of propagation is given 
(p. 183) by the square root of the ratio of the elastic constant to the 
density. For the crystal, we must use the elastic modulus correspond- 
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ing to the direction of the length of the bar. The boundary conditions 
here are just the opposite of those obtaining for the string. The greatest 
amplitude occurs at the free ends of the bar. This result may be de- 
duced formally, but is also intuitively evident, since at the ends the 
motion of the bar is completely unhindered, whereas in the case of the 
vibrating string the opposite is true. Thus the fundamental vibration 
of the bar must have a loop at each end and a node at the centre, so 
that — as for the string — the length of the bar is half the wave-length 
and the frequency of the fundamental mode is 


_ V(Elp) 

°~K~ 2 1 



An important characteristic of any standing wave is that all par- 
ticles belonging to a given loop vibrate in the same phase, so that in 
what follows the system may be replaced by two masses, located at the 
ends of the bar, each of half the bar’s mass. These must be imagined 
to vibrate in opposite phase and be subject to a restoring force of such 
magnitude as to reproduce the natural frequency of the bar. We can 
then apply the much simpler relations for the forced vibrations of a 
particle (p. 96) to the motion of either half of the bar. If there were 
no damping at all, the amplitude at resonance would become infinite. 
In reality, however, there is always at least a small amount of damping 
present, due both to the resistance of the air at the surface of the bar 
and to a certain amount of internal friction. The latter is not well 
understood in terms of atomic theory even for single crystals. It is 
quite possible that it might vanish altogether for a perfect crystal 
lattice at absolute zero. 

For what follows, the important thing is the phase relation of the 
driving force to the vibrations of the bar, or of the vibrations of the 
two particles which replace it in the above sense. The phase relation 
is given by fig. 7, p. 99. A very graphic representation is obtained by 
combining both curves of the figure into a single one by the expedient 
of expressing the amplitude a of either particle as a function of the 
phase angle <f>. For this purpose, eliminate the quantity cu 0 2 a> 2 from 
equations (33") and (33"'), p. 98, by using the relation 


sin 2 <£ = 


tan 2 <£ 


1 + tan 2 f * * * 
In the notation of p. 98, the result is the polar equation 


(7) 


F q . . 

a = sm <j>. 

pco 


(8) 


If the damping constant is small— as it is for quartz— the region 
m which the amplitude differs markedly from zero is exceedingly small. 
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As a result, a > may be taken to be constant in this region, and so equa- 
tion (8) will represent a circle. Since the displacement lags behind the 
driving force, the circle must be drawn below the polar axis if the 
driving force F is drawn along this axis, as shown in fig. 3. This circle 
then represents the index diagram for the periodic quantities F and u, 
where the latter represents the displacement of one of the particles.’ 



If an electrical potential of the form V = V 0 e* wt is applied, the driving 
force is given by 



where b is the width and c the thickness of the bar (refer to fig. 2). 
The resonance amplitude becomes 


®max 




Since the damping depends on the electrodes as well as on small in- 
homogeneities of the crystal, fi is not to be regarded as a material con- 
stant. Rather, the width of the resonance curve is used to characterize 
the damping of a given crystal. 

The polarization P follows the deformation u without lag, so that 
P has the same direction as u in the index diagram. The displacement 
current is given by dP/dt = iwP, so that it leads u by tt/ 2. If the bar 
is allowed to pass along the resonance curve by increasing the driving 
frequency from zero, then u will have the same direction as V at low 
frequencies, but the index of the displacement current will be m the 
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direction of uoV. Thus the crystal behaves like a pure capacitance, 
where the current leads the voltage by 7r/2 (p. 318). As the resonance 
point is approached, the magnitude of u increases greatly while its 
index changes to a position 7 t/ 2 behind V as the resonance point is 
traversed. At the same time, the direction of the index of the displace- 
ment current comes into coincidence with that of V. The essential 
result, then, is that at resonance the system behaves like a small ohmic 
resistance — small because the index of the displacement current 
becomes very large. The capacitance is effectively short-circuited, i.e. 
it has become infini tely large. When the resonance point is passed, 
u falls even farther behind F, the displacement current falls be hin d V, 
and the system behaves like an inductance or — what is formally the 
same thing according to equation (23), p. 318— like a negative capaci- 
tance. 

In these considerations we have neglected the usually small capacity 
of the quartz condenser and the displacement current determined by 
it, since at the critical condition the latter is small compared with the 
piezoelectric displacement current. Imagine a quartz vibrator con- 
nected in parallel with the condenser of an oscillating circuit, the latter 
being variable so that the circuit can be tuned to any impressed fre- 
quency (fig. 4). In the absence of the quartz crystal, the capacitance C 
in the relation 



would have to be made smaller and smaller as the frequency is increased, 
and the curve connecting C and co 0 2 will be an ordinary hyperbola. 



Fig. 4 



However, as the resonance point of the crystal is approached, th< 
capacitance of the condenser is increased by that of the quartz, s( 
that C must be lower than it would be in the absence of the orystal 
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At resonance the capacitance of the crystal would become infinite, 
making it necessary to give C a negative value large enough to make 
the sum correspond to <u 0 . 

In practice the ideal conditions are not fulfilled and the capacitance 
of the crystal, while large, remains finite. The further course of the 
curve follows from the same considerations (see fig. 5). The stabilizing 
effect is explained in this way: In order that oj 0 2 change but little in 
the neighbourhood of the quartz resonance, C must change consider- 
ably. In other words, a small change in the value of C — as a result of, 
say, a change in temperature — will have practically no effect on the 
natural frequency of the system. 



CHAPTER XLV 


Space-charge Effects in Gaseous Discharges 


1. Characteristic Curve of a Thermionic Tube. 

Let us consider an extended cathode, which emits electrons, placed 
opposite an extended anode, and let us inquire about the relation 
between current and voltage as qualitatively represented by fig. 1 on 
p. 433. Because of the indefinite extent of the electrodes, the problem 
becomes one-dimensional and the results will depend only on a single 
co-ordinate x. Further, we may neglect the initial thermal motions of 
the electrons, whose speeds have approximately a Maxwell distribution 
(p. 595). With this assumption, the current should be expected to 
begin only when the second electrode is made positive, while in fact 
a minute current is found to flow even when there is a small opposing 
potential. 

Under the above assumptions, the values of the potential between 
the electrodes will be investigated. It will no longer be linear, but will 
be distorted by the presence of the space charge. We have the follow- 
ing equations at our disposal: The speed of the electrons at any place 
where the potential with respect to the cathode is V is given by 

tynv 2 = eV (!) 


If p represents the charge density, the relation given on p. 418 yields 
for the current density * 

i = pv (2) 


Space charge and potential are connected by the relation (p. 267) 

AF = inp (3) 

The reason for the positive sign of the right side of the equation will 
be understood from the footnote. 

The current density i is constant throughout the space because we 

aa ?™ fche 8ltuation to be independent of any but the x co-ordinate 
and the current is constant at all points in the circuit. With the help 

to taSSytta electron 'motion.' W<> h ° re ““ *** p03iti ™ 01 th » «—■» 
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of (1) and (2) we can eliminate p and w, and obtain a differential equation 
for V: 



Multiplying both sides by dV /dx allows us to integrate at once, and 
if the potential of the cathode is arbitrarily taken to be zero, the result 
is 



Under the assumed condition of zero initial speed of the electrons 
there would be no current if the potential of the cathode were to de- 
crease, for then all the electrons would be driven back. On the other 
hand, if ( dV/dx) 0 > 0, all the emitted electrons would arrive at the 
anode and this would constitute the saturation current. Hence the 
only possibility is to set {dV/dx) 0 = 0 for the region in question. Ac- 
tually, because of the initial speed of the electrons, the lowest point of 
the potential curve lies somewhat in front of the cathode, but the above 
assumption greatly simplifies the computation and does not alter the 
final result markedly. With this condition assumed, we get 



and by separating the variables we can integrate a second time, obtain- 
ing 


1 1 73 4 = (Stti') 1 ' 2 



or 


Foe x 4 3 . 



The constant of integration is zero because the potential was assumed 
to be zero at the cathode. The relation gives the variation of potential 
between the electrodes, as influenced by the space charge. If V a is the 
anode potential and a is the separation of the electrodes, this becomes 


y/'l \/{e/m) 




This well-known “ three-halves power law ” cannot hold, of course, 
for arbitrarily large currents. As soon as i reaches the saturation value 
there is no further increase, since electrons are being liberated from 
the cathode at their maximum rate. If the initial electron speed had 
not been neglected, the V\ 2 curve would not appear to change sharply 
over to the saturation line but would have the rounded junction found 

in practice and pictured on p. 433. 
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2. Oscillations of a Plasma. 

A 'plasirui is a gas having no average space charge and located in an 
electric field. An example is the positive column of a discharge tube, 
where the linear variation of the potential (see fig. 3, p. 437) testifies 
to the absence of space charge. A plasma, then, is a mixture of elec- 
trons, ions, and normal and excited atoms. High-frequency oscillations 
can take place in a plasma, as will be shown for the one-dimensional case. 

Let the field strength be E 0 and the number of electrons per cubic 
centimetre n 0 in the non-oscillating plasma, and call these same quan- 
tities E and n, respectively, when oscillations are taking place. Since 
the current is practically all carried by the electrons (cf. p. 439), we 
need only consider the motion of these particles. The following re- 
lations are at hand: 


1. The equation of motion of the electron: 




2. The space-charge equation: 

— = -4t re(n - n 0 ). 


. . . . ( 10 ) 


3. 


The equation of continuity for the electron flow, which corre 
sponds to the hydrodynamical equation (cf. p. 196): 


d(nv) __ __ dn 
dx 9 1' 


( 11 ) 


4. The first field equation, which we write in the form given on 
p. 466, with the sum of the “ displacement ” and convection 
currents designated by S (scalar value S) : 

dB 

(c curl H). = 4™S = ^ — 4tt env. . . . (12) 

Actually, (11) is implicit in (12). Since the divergence of 
S = o curl H vanishes, we must have 

o = J^_4„ e a M 
dtox dx 

On ^the other hand, it follows by difierentiating (10) with reapect to t 


0 *E . dn 

dt dx Tt' 


and byequating, we again obtain the equation of continuity. 


dm 
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A vibrational differential equation is obtained for the electron 
velocity as follows: Multiply (10) by v and add the resulting equation 
to (12). Then, taking account of the meaning of the differential pro- 
cesses d/dt and djdt (cf. p. 195), there results 


2E dE_dE 

dt V dx di 


= 4:77*S -f- 4 7Ten Q v . 



Using (9), we get 



477&S 

m 



Now S is not only constant in space but in time as well. For since 
the applied field E — E 0 is due to charges, and not to changing magnetic 
fields, curl E = 0. But the field equations give 


curl curl E = 


4tt 0 S 

c 2 er 


so that for curl E = 0, dS/dt must also be zero. The particular integral 


S 

v = — 
n 0 e 



represents the mean electron current on which is superposed the 
electron oscillations given by the general integral 


v = ^4sin(co^ — a), where a> 


- 


(16) 


This simple and clear deduction of the oscillations of a. plasma is due 
to Rudolf Seeliger. 



CHAPTER XL VI 


Theory of Elastomers 

L Behaviour of High-polymer Chain Molecules. 

As a class, the rubber-like solid substances differ greatly from other 
solids in their mechanical and thermodynamic behaviour. In the 
phenomenon of expansion, for instance, one must look to quite different 

an explanation. Apart from the fact that 
rubber-like substances are more resilient by many orders of magnitude 
than are other solids, their isobaric coefficient of expansion is negative.* 
Characteristic of all rubber-like materials is the fact that chemically 
they are high polymers consisting of long chain molecules made up of 
thousands of identical units. An example is polyacrylic acid, a portion 
of whose structural formula is shown in fig. 1 . The molecule is so long 

hhhhhhhhh 

h — A — A — A — I — l— l — A — A — A-... 

iooH A cooh A cooh A Aooh A Aooh 

Fig. i 


that probability considerations can be applied even to the parts of a 
single molecule. Because of the rotational freedom allowed by the 
simple C— C bond, two adjoining units can, within limits, turn freely 
with respect to each other. The limiting angle is determined by the 
structure of the units themselves. After a large number n of units, the 
net effect of the many arbitrarily positioned units is such as to make all 
onentatmns of the nth member equally probable. The corresponding 
length is called the statistical chain element A. Thus the giant molecule 
is schematized as a chain of N such segments whose members may be 
onented arbitrarily with respect to one another. The internal energy 
°t the molecule is consequently independent of the form of the chain 
which is governed only by the laws of probability. If a perfectly flexible 
chain is thrown on to a table, it will usually end up in an irregular 
coiled configuration; the chance of its assuming the fully-extended 
position is extremely small. We shall characterize the form snlelv 
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specifying the straight-line distance between the two ends, which we 
designate h (fig. 2). If the beginning of the chain is held fast, a proba- 
bility calculation gives the following expression for the chance of 
finding the end of a chain consisting of N segments somewhere within a 
spherical shell whose radii are h and h + dh : 


3 h‘ 

\V{h) dh = Ke 2 am* h 2 dh (1) 

The approximate nature of this expression will be realized when it is 
seen that values greater than NA, the fully-extended length, are not 
excluded, although their probability is given as very small. However, 
this expression is adequate for our purposes. Suppose a force F is 
applied to the end of the chain in the direction AB (fig. 2). If this 

force succeeds in increasing h by an amount dh, the 
work done will be F dh. According to the first and 
second laws of thermodynamics, we have for a re- 
versible, isothermal change of state 

TdS-\-Fdh = dU. . . (2) 

However, since the internal energy U is indepen- 
dent of h, the right member of the equation is 
zero and the force is given by 

F — T (S),- ■ ■ ■ ® 

In order to compute the entropy, we need only apply the Boltzmann 
equation to the probability formula. The probability that the length of 
the line AB, drawn in the direction of F, shall be between h and h + dh 
is proportional to 

3 />* 

e 2 a ta ■ dh. 



The factor 47 rh 2 does not enter, since the end of the chain is no longer 
considered to move on the surface of a sphere, but is restricted to the 
direction of F. We then get 

3A7< 2 

S = - + const (4) 


and 



3 kTh 
NA 2 



Thus the stretching force F corresponding to a given h increases with 
the temperature, or the equilibrium length h corresponding to a given 
tension F decreases with increasing temperature. We saw (p. 575) tliat 
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for a crystalline solid the elastic forces are determined by electrical 
forces between the molecules. For an elastomer, on the other hand, 
the elasticity is caused only by thermal motion which makes a coiled 
configuration of the molecule more probable than an extended one. 

Now it is obviously not possible to experiment with an individual 
molecule. A collection of such giant molecules is not a piece of solid 
rubber but a viscous liquid, since the individual molecules are free to 
move relative to each other. This is true for the latex and for the 
sticky raw rubber precipitated from it. Only by the introduction 
of sulphur atoms (vulcanization) are the individual molecules tied 
together, permitting us to apply— at least qualitatively— the results 
obtained for an individual molecule to finite pieces of technically-pro- 
duced rubber. 


2. A Model of Muscular Action. 

In the contraction of a muscle, nature is able to do something that 
we have succeeded in doing only indirectly — the immediate conversion 
of chemical into mechanical energy. Since the amount of free energy 
thus convertible differs but little from the total energy (cf. p. 553), 
the efficiency of the process is very high. In the laboratory we can 
convert the free energy of a galvanic cell into electrical energy which, 
in principle, could be converted in turn with no loss into mechanical 
energy. Unfortunately, however, we have not been able to find an 
electrochemical process for using our greatest source of energy — the 
oxidation of carbon— for the direct generation of electricity. We are 
torced to use the roundabout means of heat engines, with their notori- 
ously low efficiencies. In the light of such considerations, the action of 
annual muscle, which up to now has been quite mysterious, is all the 
more remarkable. The growing understanding of the nature of giant 
molecules (proteins belong to this category) offers the possibility of 
explaining the situation. It has made possible the description of a 

model for the process whereby chemical energy is converted into 
mechanical energy. 

If the molecular segments mentioned above are now considered to 

? h t rgeS ’ th ® 1 mternal ^ergy will no longer Independent 
of A, and the distribution of lengths will be fundamentally chanced. The 

presence of the electrostatic forces acts to favour the extended’ con- 

hgurations in which the charges are as widely separated as possible. 

Sut* electrically-charged groups occur when the sodium salt of poly- 

thfcOON 18 J m Water ' The S0dimn ion Na+ Elates from 

he tOONa gToup to some extent; and instead of the C00H groups 
occurring m the acd, charged COO- groups are present. ® P 

„ com P uta tion yields the following results: For uncharged 
the commonest value of A is about 0-08 of the fully-extended 
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this increases to 0-7 of the maximum length. Suppose the molecule is in 
a solution whose concentration of H+ ions (or of OH- ions) can be changed. 
If such a change occurs, the law of mass action will bring about a change 
in the degree of dissociation of the COOH group, and the molecule will 
thus be either shortened or extended — the former if there is an excess 
of H + . In the process, work can be performed by lifting a weight, the 
energy coming from the chemical energy of neutralization. This model 
of the muscular process is found to correspond with experience at many 
points. For example, it correctly predicts that each cubic centimetre of 
the dry substance should be able to lift about 70,000 times its own 
weight. 



MATHEMATICAL ADDENDUM 


Some Functions Occurring in Connexion with the 

Wave Equation 


1. Generalization of x\ for any Value of x. 

In the following development we set ourselves the task of finding 
a function of x which shall represent the value of a factorial for any 
kind of x, including complex values, although this function was origin 
ally defined only for x a positive integer. This function is called IT (a;) *. 
it is often designated also as T(x + 1). It is characterized by the re- 
lation 

ri(x) = x!l(x — 1 ) ( 1 ) 

For positive, real values of x this condition is satisfied by the integral 

n(s) = r(x+l)=jfVe-‘d*; .... (2) 

for, integrating by parts, 

j[ t x e- t di = — [V'f*]" + X fj > ^ x ~ 1 e- l dt t . . (3) 

and this is equal to xU(x — 1). Now 11(0) = 1, as may be seen by set- 
ting x equal to one in equation (1). Hence repeated application of (1) 
gives, for integral values of the variable, 

H (a) = r (x -{- 1) =s 1 . 2 . 3 . 4 . . . x = x! 


The integral does not converge when x is negative or when it is 
oomplex with a negative real part. For this reason Gauss supplied a 
generally valid definition of the function II (x) in the form 


II(x) = lim , ™ 

n->« (1 + ®) (2 -|- x) (3 -f- x) . . . (n + x)* 



Obviously, from this definition, n(*) will be infinite for any integral 
negative value of x. In order to show the identity of both definition s 
tor positive ®, consider the integral 
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When n = s this becomes 


JJx) = f t x+n dt = 

Jq 


* + » + r ' * 


( 6 ) 


Integrating ( 5 ) by parts, 


n — s 


s+l 


s — | — cc — } — 1 


( 7 ) 


Repeated application of this recurrence formula yields 

J 0 = fV(i _ t )"di = . j n 

Jo J 1 J 2 t/3 J n 


n\ 


1 


whence 


(1 -f x) (2 + • • • {n + x) * w + x 4- 1* 


n x ?i! 


(1 -f x) (2 + x) . . . (n + x) 

Setting t = u/n in the integral, 

n x n\ 

(1 -j- x) (2 + x) . . . (n -j- x) 


= n x {n + x + l)J t x (l “ 0" 


dt. 


= »±f±i r(i ~ 

n J 0 \ n/ 


According to the definition of e as the limit of [1 -j- (l/w)] n for ti 
approaching infinity, we have in the limit 


lim 

n— >30 


w*n! 

(1 + x) (2 + x) . . . (n + x) 



whereby the agreement of the two definitions for positive values of 1 
is proved. 

It follows from ( 4 ) that 


fl (— x) II (x — 1 ) 


= lim 

n— >*> 


n — 1 


- 5 T?T,) ' 


= lim 

n — >co 


l 




( X “ f*) 0 “ I 2 ) (* 3‘) 'O (n — l) 2 ) 


The denominator, however, is merely the familiar infinite product 
expression for sirnTx/n , whence 


n(-x) n(x- 1) = 


7 r 

sin7ra; 
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In particular, putting x = 1/2, 

H(— £) = V 77- * 


(9) 


The expression T*(a;) is conventionally used to represent the 
logarithmic derivative of the II function. It follows from equation (1) 
that 

Y(*) = i + Y(*-1) (10) 


X 


Values of this function may be found in tables. According to the 
definition (4), 

= = Um ( log « s- r4: — (ii) 

** co \ 1 -{- 2-j-X 71 - XJ 

In particular, 

' r (0)= = (12) 
n->» \ I 2 o n) 

The negative of the limiting value of the parentheses is called the 
Euler-Mascheroni constant; its numerical value is 0-5772 .... If an 
integer m is substituted for x in (11), the subtracted terms from 1 to 
m will be missing in (12) with the result that 


V(»)--0 + J + J + ...I. 

I 6 771 


(13) 


Tor negative integral values of * the function n(&) goes to infinit y 

as l fa when x goes to zero, and so its derivative tends to infinity as 

1/jb 2 . Hence x F(x)/II(x) will be finite, and its value may be deduced as 
follows: We have 


Y(») 

U(x) 


' 1 ' 1, + 11 - A + 11 h J + » - ml 


X 


qn n (*+i) 


n(* + i) 


from which we get, by repeated application, 


Vjx) 

nw 


(®+ 1)(® + 2) . ,.(x-\-m) |V(g+«fi) - L... LI 

= - L 3+1 g+2 s+rol 

1 " ■ — • 


II(®+m) 


® 7,~? ;/ a11 terms of tlie numerator vanish except the one 
ultiphed by l/(x + m). Thus, on account of the faot that 11(0) = 1, 


3=3 : =(-i) mn 


(14) 
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2. Bessel Functions of the First Kind. 

{a) The Bessel differential equation 
The wave equation 

a 1 d 2 « 

A “ = ^ (,5) 

is transformed in the usual way, by substitution of a periodic solution 

u = v(x, y , z)e iwt (1G) 

into the equation At) + lfiv = 0 (17) 

where h = <n/c = 2tt/\. We now investigate the case of cylindrical 
waves, which are independent of the z co-ordinate. Then, according 
to Ex. 22, p. 43, the relation between the two remaining co-ordinates 
p and (j> will be 

d 2 v 1 dv 1 d 2 v , 2 

d? + pT P + J*dt 2 + kv ~ ' ' ' <18) 

Assume the solution v(p, <£) to have the form of a product of R{p) 
and <D(<£). In general, only those solutions which are single-valued 
functions of position are physically valid, so 0 must be periodic with 
a period of 27 t. Therefore we set 

<D ((/>) = (19) 

where generally, in physical problems, p is an integer. In some cases, 
however, non-integral values of p come into consideration, so that in 
what follows we shall assume p to be an arbitrary positive or negative 
number. Writing for convenience y(x) in place of R{kp), we get the 

equation 

x 2 Sc* + X ^dx + ^ ~~ V ^ y = °* * * * ( 20) 

This is the Bessel differential equation. Apart from certain special values 
of p , its solution is not expressible in terms of elementary functions. 

(h) Solution by power series 

We substitute the power series 

y = a 1 £ m ‘ -f- a 2 x m ' + W m ' + • • • a r xmr + • • • • ( 21 ) 

into the differential equation, getting 

(mf _ pZ)a lX m ' + {w., 2 — p 2 )a 2 x m ' -f {rn 2 — p 2 )a 3 x m > + . . . 

+ a 1 x tTti+i + a 2 x m *+ 2 + ®»s w,+2 + • ’ • = °* 
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m 2 — m i 2, 

m 3 = m 2 -\- 2 . 


. . . . ( 22 ) 


Since the term with the smallest exponent appears only once, 

m i = JP 2 , 




(23) 


Let us examine first the solution +p. From (22), 

m 2 =p + 2, ^3 = -|- 2 . 2, w r = y + (r— 1)2. 
We then obtain the coefficients 


a 


a 




2 ™ 2 2 — P 2 J9 2 4^7 + 4 — 


«1 


03= — 


a. 


a, 


a. 


nv + 1)’ 

% 


_ jP 22( p + 2) . 2 2 4 . 2(p -f l)(p + 2)’ 


° 4 ra 4 2 — 


a. 


<h 


T 2 2 (p + 3).3 2 6 .2.3(p4-l)(p-|-2)(p + 3j 

From this we generalize to 


a k + 1 — — 


O fc 


2 2 % + &) 


M)*«i 




2 2 * n W(^ + i)(y + 2)...(p + ^) ~~ 2 2 *n(/t)n(p + jfe)* ^ 

The coefficient is arbitrary, since a multiplicative constant may be 
inserted into any homogeneous linear differential equation. The 
normalization of what are called Bessel functions (sometimes oylinder 
functions) of the first kind is attained by setting ^ = 1/2*11 (p). The 




x v 


2*n(^) 



a 2 


+ 


x 


2 2 (p + 1) ~ r 2 4 n(2){ ? + 1)^ + 2) 

£ 6 


2*n(S)(p + l)(p + 2)(p + 


3) + . • • 


00 

2 


(- 1 )* 


*-o n(^)ri(^ -f ic) 


/x\ p+2k 

W • 


(25) 


Comparison with sin® and cos® shows that the convergence of these 



812 MATHEMATICAL ADDENDUM 

series is excellent. The character of J 0 and of J 1 is to be seen from 
fig. 1. 

In the above derivation there is nothing which restricts the order 
of the Bessel functions to whole numbers. Even for non-integral values 
of p we are able to find at once a second independent solution which 
supplies the second constant of integration (the general solution of the 
second-order equation (20) must contain two independent constants). 
Examining now the value m 1 = —p, we get an entirely different power 
series. For instance, for p = 1/3, the exponents 1/3, 7/3, 13/3 . . . occur, 



while for;? = —1/3 we get —1/3, 5/3, 11/3 .. . . Thus for non-integral 
p, the general solution of the Bessel equation is given by 


y — AJ p + BJ_ P 



The power series behaves quite differently for negative integral 
values — n of p. Since n( — n ) is zero according to equation (4), p. 807, 
there is no contribution to the sum until k = n. In order to begin 
the summation with zero again, we set k = 1 + n and obtain the 
expression x 


J-n = 


V 


(- 1 ) 


l+n 


,7o n + n) n (i) \2 


(27) 


But this is identical with ( — 1 ) n J n - Hence for integral 
order number p, , ,. n t 


values n of the 



In the important instance, then, where p is integral, m = p does not 
supply a second independent solution. This solution must be de- 
termined separately (see § 3, below). 
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(c) Relations between the Bessel functions and their derivatives 

The Bessel functions have certain similarities with the trigonometric 
functions; in particular, J 0 [x) is comparable with cosx, J x (x) with sins. 
Like the Bessel functions, the trigonometric functions have infini tely 
many zeroes. This similarity also exhibits itself in the relations between 
the derivatives of the functions and the functions themselves, com- 
parable with the relation d(sinx)/dx = cosx. 

Let us show, first, that 


J v (x) — \\J t>-\ ( x ) — J p+l (#)] 


(29) 


By differentiation of the series expression for J v we get 

J ' (x) _ 1 2 + 

,() i k-on(k)n(p + k)\2) 

Writing the factor p + 2k in two parts, p + k and Jc, we can obtain 
two summations. The first yields the series 


i s (P + *)(~ D* M 
5 A - 0 n(^n(p + A> \2/ 


P+2Jfe-l 


CO 

= 2 


(- 1 ) 


=T >(?) 


P+2*-l 


*- 0 n (&) n(p + k 

The series formed with the second factor will be 


v-\ (* c )* 


00 


is 


£(-l) 


*-o n(A:)n(p -|- k) 



P+2*-I 


QO 


= 1 s 


(-!)» 


*-on(jfc — l)fl(p -f &) 



P+2Jfc— 1 


^ nc ® ( — 1) = 0, the term corresponding to k = 0 will vanish and 

the denumeration will really begin with k = 1. In order to start 
counting with zero, we put k = 1 + 1 and obtain 


00 


-I 2 


(- 1) 1 


;-o n (i) n (p + 1 + 1) 


/ x\ * >+2l + 1 

(jjy = — h^v+i (®)» 


which proves the relation (29). In particular, since = — J, ( x ) 
we get /» 

J B '[X)=J 1 {X) (29a) 

In the same way, we find that 

•V (*) = - 1 J v (a) + (x) (30) 

From (29) and (30) it follows that 

2 p 


X 


J 9 (®) — i/p-i (s) + J _ +1 (a). 


• « 


(31) 
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A variety of further relations may be deduced from the recurrence 
formulae (29) to (31), but none of these will be given here. An impor- 
tant integral relation, however, will be derived. Multiplication of (30) 
by x p yields 

x p J p ' (x) -f- px p ~ l J p (x) = x p J p _ 1 (x). 

The left side of the equation is the exact derivative of x p J p (x), so that 
the result of integration is 

/ xP J p-i i x )dx = x p J p (x) (32) 


(d) Representation of Bessel functions of integral order by 

definite integrals 

It will be shown that for integer values n of p, the fundamental 
differential equation (20), p. 810, is satisfied by the function 


1 f 71 

J n (x) = - / cos(x sin (f> — ruf))d(j). 

ttJq 


(33) 




Differentiating, 

dJ n {x) 1 r n . 


aj n z if., • , ,v • , , . 

— j = / sm(x siruf) — tup) sin <p d<j>\ 

(IX 77 J o 

— j n }— = — - f cos(x sin</> — w<£)sin 2 <£ df. 

(i X 77 J o 

Substituting in the differential equation, we must have 

r n . 1 r 71 . . 

/ cos (x sin (j> — 7uf>) sin 2 (/> d<f> -\ — / sin (x sin <j> — nf) sin <f> d(f> 

J o 


r(H 


cos(x sin</» — n</>)d<f> = 0. 


To prove the validity of this equation, transform the second integral 
by integrating by parts, obtaining 

f sin(x sin </> — n<£)sin<£ c/</> = — |"sin(x sin<£ — tt<£)cos</>l 
Jq l o 

*71 

-j- / cos (j) cos(x sin <j> — n<£)(x cos</> — ri)d<j>. 

Jo 

Combining with other integrals, we get simply 

- ” f cos (x s\n<t> - n<j>) (x cos </> - n)d</> = - - [sin(i sin<ji - *<£)] 

X Jq 0 

which turns out to be zero, thus proving that the function (33) satis- 
fies the Bessel equation. 



MATHEMATICAL ADDENDUM 


815 


f 


(e) Orthogo'nality of the Bessel Junctions 



The computation of the coefficients of Fourier series is made possible 
by the vanishing of certain integrals of trigonometric functions. The 
Bessel functions have similar properties, so that series developments 
can be set up in terms of Bessel functions instead of in terms of trigo- 
nometric functions.^Phis may be shown by starting with the following 
differential equations: 

+ Wi + (a 2 ® 2 - p 2 )y x = 0, 
a% 2 " + + OSV - p 2 ) y 2 = 0. 


By making the substitutions z = ax, z = fix, it is easy to see that the 
solution of the first equation is y 1 = J v (cue), and that of the second is 

2/2 = ^ v Multiply the first equation by yjx, the second by y^jx, 

and subtract: 


~ Vz'yi) 4- (y/ 2/2 — y 2 's/i) -f- (a 2 — 0 2 )®y 1 y 2 = 

Multiply by dx and integrate from 0 to 1. Then, since the first two 
parentheses taken together are exactly equal to the differential of 
x it/it/z — yM, we have 

(“ 2 - F) l xyiVzdx = -[xfc/y, - ft'y,)]*. 

Substituting the Bessel functions in place of y and remembering that 
V\ = aJ v ' (ax), there results 


(a 2 ~~ (ax)J v (f$x)dx 

= [x{pj, (ax) J,’(Px) - aJ „ (jfc) J,'(ax)}]\ 

If we exclude functions of fractional, negative orders, p < 1 which 

become infinite of too high an order when x = 0, the expression in 
brackets will vanish at the lower limit and we get 

f'x J v (a x)J 9 (Bx)dx = (PWv’M 

J o a 2 — fZ 2 * 

2’ T W i a “ d P ar ®* wo different zeroes of the Bessel function, we have 
the first orthogonality equation, 

f x J 9 (ax)J v (px)dx = 0. 

•'0 
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The expression becomes indeterminate when a = p. Assuming 
to differ from a by a small quantity which is allowed to approach zero, 
the value for a = may be obtained by differentiating numerator and 
denominator with respect to £, and then setting — a. The result is 


f'xJ v (ax)J v (ax)dx = — * (a)J »'( a > 

•'o 2 a 


Using the fact that J P (a) = 0, we get the second orthogonality equation 



3. Bessel Functions of the Second and Third Kinds. Asymptotic Values 
of Bessel Functions. 


(a) Expansion of Bessel functions of the second kind in series 


When integers are substituted for p in equation (25) no new function 
is obtained. To find this result, we pass to the limit by letting an 
order number which differs slightly from a whole number approach 
the integer value. Consider the expression 




J p ( x ) cos;;7T — J_ p (z) 

sinp?r 



which certainly represents a solution of the Bessel equation. When p 
is an integer, both numerator and denominator vanish, and the limit- 
ing value is obtained by differentiating numerator and denominator. 
The result is 


W»(*) 


— tt smpnJ p {x) + COS prr 


d_J v {x) 

dp 


77 COSpTT 


dJ- p {x) 

dp 




dJ _ v {xY 
d P _ 



Using the notation of § 1, we get, further, 

~ dJ v (x) l = g (- 1 )* / ~ W2 ‘ 

dp _ p — n 

s J-A*) 

dp J p-n 


M- 0 n(k)n(n -f- k) 




(- 


00 

= *-o ffT*) n ( 
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Multiplying by (— l) n and subtracting, we have, by (14), 


n+2 * 


__ « (— I)fc+"T(— n + k) /x\~ n 

k-o n(£)Il(— n -f- £) \ 2 / 


-n + 2fc 


(38) 


As long as k ^ n, the same powers of ® occur in the first summation 
in connexion with n -f- k as do in the second one in connexion with k. 
The coefficients are 'F(&)/II(& + w)II(&), and if these are combined 
with the first summation, all that remains of the second one is the 
part from k = 0 to k — n — 1. Here the coefficients are [cf. equation 
(14) of § 1] : 


y(-n + *) 

!!(/;) ri (— n + k) 


{ J n(£) 


Finally, 


£ ( 1)t yw+y(»+*) /«y 

An( } 7 T " g * 2 7 T A-0 1} n(i)n(» + ifc) \2/ 


+ 2 * 


1 ”~'n(n-k- i)/A- w+8fc 


-- S 

7 T A -0 


nw 


G) 


(39) 


It follows from this development that the Bessel functions of the 
second kind (also called Neumann functions) become infinite as ar n 
when x approaches zero. For N 0 the sum involving negative powers 
drops out and the first term makes the entire function approach an 
infinite value loearithmicallv (see fier. 2L 


(6) Bessel functions of the third kind (Eankd functions ) 
Bessel functions of the third kind are defined by 

n 9 il} (*) = J 9 l*) + w 9 l*)A 

B™(x) = J p (x)-iN v (x).) * * 


(40) 


As will be shown immediately, these functions are related to J 9 and 
N* in the same way that e ix is related to sin® and cos®. 

(c) Asymptotic forms of the Bessel functions 

The differential equation (20) may be reduced to a simpler form 
by setting 

y = #*z (41) 
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Fig. 2 


and determining the exponent m in such way as to make the term 
containing the first derivative of z vanish. As may be readily verified, 
this occurs when m = — 1/2, leaving the following equation for z: 

(42) 

Obviously when p = +1/2 this becomes the differential equation of 
simple harmonic vibration and J \\ 2 as well as J_ 1/2 must be of the form 
(Asinx -\- B cosx)ly/x. Comparison of the series expressions yields 
the correct values of the constants: 



However, for any value of p we again get the harmonic equation if a; is 
made large enough. The asymptotic values of the Bessel functions 
become, after determining the constants, 
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E«\x) 

H™(x) 







for x large. 


That is, 


J v (x) = • cos — 71 for x large. 


This leads to a way of representing cylindrical waves. If, in accordance 
with equation (16), p. 810, we set 

u = J (kp)e ia> \ 


all particles will vibrate in the same phase and we have a standing 
wave. A progressive wave will be characterized by the use of <o(t — x/v) 
as the argument of the periodic function. As the asymptotic form 
shows, this is accomplished by the Hankel functions HJ- 2 ^ in conjunction 
with e* w *. 

Ex. 135. Represent J 3/2 by means of elementary functions. 


4. Spherical Harmonics. 


(a) Differential equation of spherical harmonics 

• 

The Laplacian potential equation Au = 0 leads at once to another 
class of functions. We seek such solutions as are homogeneous and of 
degree n in the co-ordinates x , y , z. These solutions, expressed in 
three-dimensional polar co-ordinates, are of the form 

u = r n F n (Q y <j>) (44) 

We write the potential equation in polar form and substitute (44). 
Then, according to p. 268, the equation for F n becomes 

^Bde{ sme w) + ^9W + n{n + i)F '' =0 - (45) 


Inasmuch as the equation remains unchanged if — (n -f 1) replaces n, 
r {n+1) F n is also a solution if r n F n is a solution. 

Assuming independence of <£, a simpler equation results by setting 
cosy = x and F n = y: 


+ n(» + l)y = 0. 


(46) 
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(6) Legendre 'polynomials 

As in the case of Bessel functions, we can find a power-series solu- 
tion for the differential equation. It is found that for one of the two 
solutions the power series terminates, i.e. the solution is given by a 
polynomial. However, a more vivid way of arriving at our goal is the 
following: Consider a unit electrostatic charge located on the 2 axis 
at unit distance from the origin. At a point P whose distance from the 
charge is r', the electrostatic potential will be 

u = ~ = (1 + r 2 — 2r cos0)~ 1/2 = [1 + r(r — 2x)]~ 1/2 . (47) 


If r is small, the radical may be expanded by the binomial theorem, 
yielding a series in powers of r. Since each term is a solution of the 
Laplace equation, the coefficients will be the Legendre polynomials (or 
surface zonal harmonics ) P n that we seek. Denoting the largest integer 
which is less than or equal to n/2 by [n/2], the general expression for 
these functions may be written 


p„(x) = s' (-1)* 1 - 3 - 5 ::: (2n 2k — ^ 


2k 


k - 0 


2 k k\(n — 2k) \ 


(48) 


The polynomials forn = 0 to n = 5, inclusive, were given on p. 699. 
Since x = cos0, we obtain F n in powers of cos 6. Using the well-known 
formulae, these functions may be expressed in terms of multiples of the 
angle. For example, for P 2 , 

P 2 {x) = £(3s 2 - 1) = i(3 cos 2 9 + 1). 


(c) Relations between spherical harmonics of various orders 

The development of (47), arranged in ascending powers of r, has 
the form 

(1 _ 2 rx + r 2 )“ 1/2 = P 0 (x) + rP x {x) + r 2 P 2 (x ) + . . . r n P n (x) 

+ r n +'(P n+1 (x) + ... . 

Differentiating throughout with respect to r, 

(1 — 2 rx + r 2 )" 3 *{x - r) = P x (x) + 2 rP 2 {x) + . . . nr n ~ l P n {x) 

+ (n + l)r"P„ +1 (*) + ... 
or 

(1 _ 2 rx + »•*)->«(* - r) = (P^x) + 2 rP t (x) + . . .) (1 - 2ra + r 2 ). 
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Developing the left side once more according to powers of r, 

(x - r ) (P 0 (x) + rP 1 (x) + ... r n P„ (x) + r"+ 2 P n+1 (x) + . . .) 

= (P, (x) + 2rP a (x) + . . .) (1 - 2rx + r 2 ). 

Comparison of the same powers of r yields the relation 

(2 n -b l)xP n (x) = (n -f l)P n+1 (x) + nP n ^(x). . (49) 

In the same way we obtain, by differentiating with respect to x, 

(2« + l)P„(x) = P'„ tl (x) — P]i-ii x )- • • • ( 50 ) 


(d) The associated harmonics 

We now take up the consideration of a single-valued dependence of 
F n on the azimuth (f>, setting 

Fn(6, <!>) ~ & m *fn(6) (51) 

We then obtain as the differential equation for f n 




l d . a 

sin Ode Sm ° dd 


(52) 


Taking /„ (cos 0) = y(x), this gives 


d2 y o_ 


(1 ~ is - 2 *I + [ n(n - x) - n? p] y = 0 


(53) 


In order to simplify this relation, we set y = (1 — x 2 ) p w m and deter- 
mine p in such a way as to make the coefficient of u m a constant. 

The result is , /n 9 , . 9rr 

p = $m, y = (1 — x 2 )-' 2 U m , .... (54) 

and the differential equation for u m becomes 

(! “ ■ «*> S? ~ 2(m + 1)* S? + [»(» + 1) - m(m + 1)] u ra = 0. 


(55) 


Differentiating, 

S? 2(w + ! 2)x d j£ + W"- + !)' ~ (»' + !)(«■ + 2)] ^ = 0. 

(56) 

Thus if u m satisfies (65) for m, then dujdx satisfies (55) for m + 1. 

For m — 0, u 0 becomes identical with P nt and so d m P n (x)ldtf* 
satisfies (55) for m = m, and the solution of (53) for m = m is 

= (57) 
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These functions are called the associated Legendre polynomials or 
associated harmonics. Since the nth derivative of a polynomial of nth 
degree is a constant, we have 

P n "{x) = C(1 - x 2 ) m/2 = C sin"0 
and P n m (x) = 0 for m > n. 


If (49) is differentiated m times, the result is 


(2n + l)x d "‘4 n P + m(2n + 1 ) - ^ x) 


dx 


, „ d m P „_!<*) 


= (W + 1) "-^'+ n 


dx 


in 


Now if (50) is differentiated (m — 1) times, we get 


( 2 » + 


(x) d m P (x) 


dx m ~ l 


dx™ 


dx 


m 


Multiplying the last equation by m and subtracting the result from the 
preceding equation, we get, after multiplying by (1 — x 2 ) mi2 , 

(2 n + l)zP n m (s) = (n — m + 1)P™ +1 (») + (n + mJP^a;). (58) 


(e) Orthogonality of the Legendre polynomials 

Consider two Legendre polynomials having the same upper index 
but different lower indices n and n' and satisfying the differential equa- 
tions 


d 

dx 

d 

dx 


(1 _ **) + [»(» + 1 ) - 
(1 ~ a ' 2) dP *dx (X) 1 \ + [ n ' {n ' + J) ~ 



p,r (x) = o, 

p„ m (x) = 0. 


Multiply the first of these by P n - m , the second by P n m , and subtract 
the second from the first. Integrating the result between the limits — 1 
and -j-1, we get 


[n(n + 1) — n'(n' + f 



P n m (x)P n m (x)dx — 




dx. 

dx 


( 69 ) 
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Integrating by parts, the first integral on the right becomes 

dP n m (x) 


fj^ x) i [ (1 - x * ] 


dx 


■ dx = 


[o - - -> t t’ * 


The bracketed expression vanishes at both limits, while the integral 
expression cancels the corresponding one that is obtained in a similar 
way from the second integral in (59). Hence 


r + * 

j P n m ( x ) P n . m (x)dx = 0 for n' 4 = n. 


(60) 


r +i 

The computation of / [P n m (x)] 2 dx is somewhat more trouble- 

J-i 

some, and so we content ourselves with quoting the result: 



2 (n 4 - m ) ! 
2n + 1 (n — m) 1* 



As in wave mechanics, the proper functions — mostly Legendre poly- 
nomials — are assumed to be “ normalised to unity ”, the values given 
in tables must be multiplied by the reciprocal of the square root of the 
preceding expression if numerical values are of concern in the problem. 


(/) Application to the wave-mechanical calculation of the selection 
rules for the magnetic and azimuthal quantum numbers 

In the older quantum theory, the selection rules were deduced by 
applying the correspondence principle to the atom model. In wave 
mechanics, these rules are consequences of the orthogonality properties 
of spherical harmonics. Imagine a unique direction ohosen in an 
arbitrarily weak magnetic field, and let this direction be the z axis. 
Neglecting normalization, two proper functions of hydrogen may be 
written in the form (p. 702) 

= flfMJy^cosfl) cos (i = 1 or 2). 

According to p. 705, the dipole moment governing the radiation ac- 
companying the transition from 1 to 2 will be 


4> = efr 


where the integration is to be extended over the entire space. Since 
a — r coso, the * component of this moment may be written out in 
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the form of a product of three integrals: 

Vz = e f rR 1 (r)R 2 (r)r 2 dr 
J o 

X P^oosS^ricosO) cos 6 sin 6 d9 
X 

The last integral vanishes, except when = m 2 . This is the selec- 
tion rule applying to the magnetic quantum number for light vibrating 
parallel to the field direction. For m 1 = m 2 = m, the second integral, 
written in terms of the variable x, becomes 

= f 'xP,™ (x) P,” (x)dx (63) 



According to equation (58), this becomes 

•h = P'rWPCWdx + P7,-i(x)Pi, m (x)dx. 

(64) 


The first integral is zero except when l 2 = l x -j- 1 ; the second is zero 
except when l 2 = l\ — 1. This is nothing more nor less than the selec- 
tion rule l — > l + 1. 


(</) Legendre functions of the second kind 

For the sake of completeness it should be added that the Legendre 
polynomials, containing as they do only one multiplicative constant, 
represent only one of the pair of solutions of the differential equation. 
A second solution, for m = 0 and n = 0, is 

$o( x ) = tanh -1 £, (65) 

and for m = 0 and n = 1, 

(?! (x) = x tanh -1 x — 1 (66) 

The succeeding Legendre functions of the second kind are obtained 
from the following recurrence formula, corresponding to equation 
(49), p. 821 : 

(«-t-l)C„+i ( x ) = (2 n + 1 ) xQ n (x) — (z). . (67) 

In the same way, the associated Legendre functions of the second kind 
are obtained from 

$„”•(*) = ( i-* 2 ) m ' 2 -^- • • 


(68) 
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SOLUTIONS OF THE EXERCISES 


The more difficult exercises are marked with an asterisk. In such instances 
it is intended that the solution should be consulted as an aid in solving the problem. 


Ex. 1, p. 10. A + B -f C - 0. 

Ex. 2, p. 10. (a) B = aA; (6) C = aA + (3B, where a and (3 are scalars. 

Ex. 3, p. 12. This is the trigonometric Law of Cosines. The alteration in sign 
of the term 2AB cos (A, B) is due to the fact that here cos (A, B) refers to the 
exterior angle of the triangle. 

Ex. 4, p. 12. (A + B) and (A — B) are the diagonals of the parallelogram 
formed by A and B. In the particular instance where A 2 = B a there results the 

of a rhombus are mutually perpendicular. 

Ex. 5, p. 12. CosO = s x s z = cos 04 cosa 2 -f cos^ cos(3 2 + cosyi cosyj. 

Ex. 6, p. 15. Project the vectors B, C and (B 4* C) on the plane normal to 
A, in which the product vectors [AB], &c., he. The product vectors are obtained 
from the projections by rotation through n/2 and multiplication by | A|, since 
the magnitude of the projection of B is | B | sin(AB). Since the projection of the 
sum of vectors is equal to the sum of the projections, and since the altered parallelo- 
gram of the projections is similar to the original form, (16) is proved. 

Ex. 7, p. 15. sin 2 0 + A 2 B* cos 2 0 = A*B*. 

Ex. 8, p. 15: 

(a) Since the projection upon n of a radius vector drawn to any point of the 
plane must be equal to p, the equation of the plane must be 

m = p or x cos a + y cos (J -f z cos y — p — 0. 


theorem that the diagonals 


(6) Po = ( r o — r)n — r 0 tt — p. 

(c) The vectors (r — r x ) t (r 2 — rj and (r 3 — r x ) are to be coplanar, i.e. the 
parallelepiped formed by them must have zero volume, whence the equation 
of the plane is 

~ 'iKto - r x ) (r 3 - rj)] = 0, 


or, in terms of the co-ordinates, we get the equation of the plane: 



V —y x 2 ~ *1 
yz-vi z*- 2 ! 

Vs — Vi 2 s - 2 1 



Ex. 9, p. 17. If we Bet [AB] = E and apply (24) we obtain 


C • ED - D • EC = C • ABD - D • ABC, 


i.e. a veotor in the plane of C and D. Similarly, putting [CD] 

® ^ = B ' CDA — A • CDB, a veotor in the plane AB. 

826 


= P yields 
Hence the 
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product [ [ AB] [CD] ] is a vector along the line of intersection of the planes of AB 
and CD; this is also evident geometrically. 

Ex. 10, p. 20. When A has a constant direction. 

Ex. 11, p. 20. Since ^<fS = 0 for any closed surface, the integral over the 
curved surface is equal and opposite to the integral over the plane surface bounded 
by the curve. But this has the direction of the normal to the plane, which is also 
the direction of the binormals of the boundary curve. 

Ex. 12, p. 23. The level surfaces of the function r{x, y , z) are spheres, so that 
the gradient has the radial direction. Proceeding by dr in the direction normal 
to the sphere, the function changes by dr = d \ r \ — hence the magnitude of the 
gradient is unity, and grad r represents a unit vector in the direction of r. We 
designate it by r Q . 

Ex. 13, p. 23. grad/ = J^gradr = = \% r ' 


Ex. 14, p. 25. For a geometric derivation choose as volume element a portion 
of the spherical shell of radii r, r + dr, which subtends a solid angle dO. at the 
centre. Since r is radial, it is normal to the spherical surfaces and the surface 
integral becomes 

jrdS = dO.(r + dr f - dftr 3 = 3 dQ.r*dr. 


Since the contained volume is dx = dQr*dr, the limiting value becomes divr = 3. 
According to the formula, this is obtained immediately from 

»* +!*+»*_ 3 . 

dx OXJ cz 


Ex. 15, p. 25. The value of the exports is equivalent to the gold import; the 
value of goods imported is equal to the gold export. The “ sources ” of the out- 
wardly directed stream of gold are the places where imported goods are consumed, 
and a similar statement is true for the centres of production of exported wares, 
which are “ sinks ” for the corresponding flow of gold. The integral of the gold 
stream over the boundaries represents the balance of trade; according to Gauss’s 
Theorem, this is equal to the change of the gold supply within the country. By 
the term “ gold ” we mean simply any item of credit. These considerations do 
not include payments without recompense (interest, &o.). 

Ex. 16, p. 29. If radius vectors are drawn from 0 to points on a curve, 
ds = dr, i.e. jrds = §dr 2 /2 = 0, so that curl r = 0. This results also by direct 
computation from dzfdy = 0, &c. 

Ex. 17, p. 37. t grad • t = ”, t grad • b == —xn. 


Ex. 18, p. 37. If x, y, z are the co-ordinates of a point on an axis, 

a u x + a l2 y + a 13 z = Xx ' 

a 21 x + o 22 y + a 23 z = Xy - 

<*31* + <*32!/ + a 33 Z = ** . 



These equations, 
vanishes: 


homogeneous in x, y, z, are solvable only if the determinant 


<*U ^ <*12 

<*21 ^22 ^ 

<*31 <*32 


<*13 

<*23 

<*33 ^ 




= 0 . . 


. ( 2 ) 
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This so-called secular equation is of third degree in X. If one root is known, two 
of the equations (1) then give values of yjx and zfx, i.e. the directions of the prin- 
cipal axes. If we imagine the co-ordinate system placed along the principal 
axes, X will give the ratio of the magnitudes of the vectors Or and r for the axial 
directions. Since this is true for any co-ordinate system, the coefficients of the 
secular equation must be independent of the co-ordinates. Since, in the system 
of the principal axes, they contain only the a i{ , which we denote by aj, an* 
am, we have 

a \i 4" a 22 4" <*33 = 4* ®n 4* ®m> 

®ll®22 4" ®22®33 4" ®33 a H ( a i2 8 4" 013* “P ®23*) “ ®I®n 4* 4* 

a ll°2Z a 33 + U-j2®23^31 4~ a 13 a 21°32 — ( a ll a 23 2 4~ &22®13 3 4” a 33 a 12 2 ) = a I a U a IU* 

An application of this tensor invariant appears on p. 176. 


* Ex. 19, p. 40. 

(а) For a finite volume, integration of equation (58) (p. 31) gives 
^udS = J gradudx. But grad ir — i, hence §ir ■ dS — J *dx — iV. 

(б) We determine first the projection upon * of the vector v, whioh we seek, 
by scalar multiplication by t: 

iv = §ir- idS = i § ir • dS = i 2 V. 


Divide the volume into cylinders parallel to i. The lateral surfaces contribute 
nothing, since tdS = 0, while the bases together yield r • xdS, a veotor parallel 
to i. The resultant veotor is therefore parallel to i, hence v = iV. 


Ex. 20, p. 40. A i = div grad - = div ( — = 4- ~ gradr — i divr, 

T T \ T*/ f 1 

Now gradr = r/r, divr = 3, so that A ^ = 0. 

Ex. 21, p. 40. Cf. p. 270. n j / 

i t 

Ex. 22, p. 43. Cylindrical co-ordinates: 


(grad 4«), 


ft ■ (grad +*• = % = l % 


| 18 (P M I ldA * 

92 T p 3p 


(ourl A), = - 


1 d(pA^) 

9 


l dA 


9 


(curl A) p = - 


1 dA 


(curl A)* = — 


9 3$ 

d A 

dz 


_ dA$ 

dz 

BA , 

d 9 


AtJj 


_a* 4 > t i . 1 gj? _l 1 


1 a P * + pdp + p*a*» % 
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Spatial polar co-ordinates: 


(grad t|;) r = 


div A = 


2ib 1 d<b 

&r’ (H W* = rd’ (gTad V* 

^|(^r) + r -ri rg 4(8ine^ 8 ) 


= 1 a A 

r sin 0 d<J> 


+ 


sin 0 £0 
1 8A4 


r sin 0 24 > 


(curl A) r 


= 1 ( dA o _ d _ 

r sin 0 \ 2<J> 20 


(sin 


in 04^ 


(curl A), = - % _ i 


1 dA r 1 8{tAq) 

r 8Q r dr 


(curl A)e = 


A = 


1 / 8(rAt) _ J_ 8A,\ 
r \ dr sin 0 8$ ) 

8Pty 2 dip 1 8 2 ip I 

dr 2 + r dr + r* 8Q 2 + r 2 001 0 20 



+ 


1 8 ^ 


r 2 sin 2 0 8<p 2 


Ex. 22 a, p. 46. The truth of (a) is already contained in the solution of Ex. 18. 
Here, in order to show the application for arbitrarily many dimensions, we call 
the co-ordinates x v x z , x 3 , . . . , x n instead of x , y, z. In this notation, the system 
of equations (1) appearing in the solution to Ex. 18, corresponding to the root X<*’ 
of the secular equation, becomes 

2a rt x fc ® ) = * = E 2 n. . . . . (lo) 

k 


Likewise, for any other root X ff) , 

2a rt ** (,) = X« i = 1, 2, . . . n (2a) 

* 

Multiply the first equation by x^ and the second by perform a summation 
on each and then take the difference: 

22*," W*’ - ££*,'» W> = (» w - . (3) 

i k i k < 

But A is assumed to be a symmetric matrix (a ik = a M ), so that the double sums 
on the left are equal and the components of the “ characteristic vectors which 
correspond to various roots of the secular equation have the property that their 
scalar product vanishes. They are mutually orthogonal m n-dimensional space. 

Inasmuch as the equations (1) are homogeneous, they tell us only the ratios 
of the Xi »K e.g. the directions of the vectors. Their magnitudes are stiff at our 
disposal, and we may set them equal to unity (“ normalize them), ihe { 


* German: Eigen( vectors). 
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give the direction cosines of the co-ordinate system associated with A (in three 
dimensions, the axes of a second-degree surface) relative to an arbitrarily-chosen 
system. If these n 2 quantities are used to form a matrix in such manner that the 
components corresponding to a given X (A> are placed in one column, then this 
matrix S will be orthogonal and the matrix S _l will be its transpose. If the com- 
ponents of any vector r are x k * in the characteristic system, then the components 
in any arbitrary system are obtained, according to p. 34, by taking 


r = Sr*. 


Multiply r by matrix A, giving a transformed vector 


r 1 = Ar = ASr* 


whose components are given in the arbitrary system. According to p. 34, to 
obtain from this the components in the eigensystem we must take 


r 1 * = S -V 1 = S -1 ASr*. 


When this is calculated out, S -1 AS is found to be a diagonal matrix having X (1> , 
X< 2 >, ... as diagonal elements. This must be the case, since in the eigensystem 
we have simply x, 1 * = X (1) x,*, x* a * = X (a) x 2 *, etc. 


Ex. 23, p. 57. In the real form, f(t) = 1/2, 

CO f 2n/o> CO r2n/» 

b n = ~ I 1 • cos not cU = 0, c_ = - / sin notdt 
J n/a> 71 J n 

c n = [cos not] 2Jl,Q> = — for n odd, 

n imo n/m nn 

= 0 for n even. 


Ex. 24, p. 67. By equation (26) (p. 56) the coefficient A a for the interval from 
4 to s da is given by 

A. — f e-o* cos ox e-^tedxi 
Jo 


where the variable of integration is denoted by x, and /(x) is understood to be 0 
for negative values of x. This yields 





ax e imx-2mxa fa 


+ J f e~ox t ~Uox-inix* fa 
Jo 




+ 


* 


a - »{<*> - 2rw) a -f- »(m + 2ns)' 

In practical cases a <C o>, so that it is only when 2 tm is in the neighbourhood 

Aa - “ST® 7^ ue8 appreciably different from zero. The conjugate 
romplex value is obtained for As may be seen by combining the two forms 

^ amplitude of the vibration offrequenoy « turns 
out to be twice the modulus of A, (or The square of the amplitude, which 
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determines the intensity in the case of light vibrations, has the following value 
when the second integral is neglected: 


or for 
and 



1 

a 2 + 4 tt 2 (v - s)-' 



rnax. 


1 



J 1/2 = \ Im ax. for V — 5i /2 = - 


Ex. 25, p. 61. The extreme cases are a “ figure eight ” closed curve for 8 = 0 
and a parabolic arc for 8 = tz/2. 


Ex. 26, p. 64. If the polar angle $ does not enter, 

d*u 1 du _ 1 c 2 u 
dr 2 + r dr ~ c 2 dt 2 ' 


Setting u = v(r)e iuit , we obtain 


d 2 v 1 dv 
dr 2 r dr 




If we put x = rco/c, we obtain the differential equation of cylinder functions of 
zero order: 

d 2 v 1 dv 

d* + -xdi + V = 0 - 

This equation has two distinct integrals, the Bessel Function J 0 (x) = «/ 0 (rco/c), 
which remains finite at the origin, and the Neumann Function which becomes 
logarithmically infinite there. The general solution is the sum of each multiplied 
by an arbitrary constant. Tables of these functions may be found in the Jahnke- 
Emde collection (see reference on p. 868). 

Ex. 27, p. 66. Integration of the differential equation (54) (p. 66) under the 
condition l/V = kv yields the law of dispersion 

v 

v = . = . 

- C a j 


Ex. 28, p. 71 . Of. p. 202, formula (32) et seq. 
Ex. 29, p. 74. 


i. 


2n 


d<\> _ 


0 1 + e cos <}> 


r 2 n 

~Jo J 


d$ 


+ |(e«* + «"*'*) 




dz 

IT 


2“ + - Z + 1 

e 


* Ju nit 

Circle 



Assuming e < 1, the roots of the denominator are 
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The point z x lies within the path of integration. Decomposing the denominator 
into partial fractions, 

i = _i_ (_! L-\ 

(z -z l )(z -z*) z 1 -z t \z-z l z-zj 

and the coefficient of l/(z — z x ), the residue, becomes 

1 


so that 


r n - 

Jo I 


d$ _ 


2 S - 1 
2™ -2 


2n 


-f- e cos <|) 


d ' 2 V^- ] 


VI -e* 


Ex. 30, p. 78. If F = (Fy, y'), the Euler- Lagrange equation gives 

&F , , d*F ,, dF A 

as a y ^ + a y °- 

Multiplying by y' and adding and subtracting the expression y'XdF/dy'), we 
obtain 

d ( ,dF\ dF A 
dxV di/J dx ~ 0, 


from which the result follows. 


Ex. 31, p. 85. From the equation of motion 

d*r 


it follows that 


m — = A ain cof 
dJ} 


dr A . , A 

— = cos cot -f- 0. 

dt m co 


Let ( drfdt) 0 = t> 0 for t = 0; then 


C = v 0 + 


Integrating again, 


rn co 


r = ~ ~7 2 Bin + (« 0 + — ) * + 
mco a \ mco/ 


It is to be noticed that in general there is also a uniform transl&tory motion 
superposed upon the osoillation. 

Ex. 32, p. 85. Since F => — fngh, where k is the unit veotor directed vertically 
upward, we have from 

d*r 


m 


dt* 


— —mgh 


the result that the motion is independent of the mass. Integrating onoe, 


dr 

gkt + V{ D . 
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Thus the hodograph is a vertical straight line. Taking the origin at the position 
of the point when t = 0, a second integration gives 

»•= - o' ; * + 


This shows that the orbit remains in the plane of k and v 0 . Taking this to be 

the xz - plane and calling the angle between the initial velocity and the x-axis a, 
we have 


x = v 0 t cos a, 

2 = v 0 t sin a — £ gt 2 . 


Elimination of t shows that the orbit is the parabola 


2 = x tan a 


gx 2 

2v 0 2 cos 2 a* 


Besides x = 0, there is a point s where the trajectory crosses the x-axis. 
given by 


tan a = 


sg 


v, 


2 


2v 0 2 cos 2 a 


or s = — sin 2a. 
9 


This is 


This so-called range of the projectile is a maximum for a = tc/ 4, and has the 
same value for a = tt/ 4 + 8 as for a = tc/4 — (3. 


Ex. 33, p. 87. Motion is communicated to the coin by means of friction, 
which depends in this instance only upon the normal pressure between the sur- 
faces. The frictional force may be taken equal to [img where ^ is the coefficient 
of friction. Take the x-axis in the direction of motion, the origin being at the 
rim of the glass. Assume the rear edge of the card to be at this point also. Then, 
for the coin, 


x = a + \ig 


t 2 
2 ’ 


where a is the radius of the glass. If v is the velocity of the card, the rear edge 
of the card has reached the opposite rim after T = 2 a/v seconds. If the coin is 
to fall into the glass, it must not move farther than x = 2a in this time, so that 



^ 2 a 


is the condition to be satisfied. 


Ex. 34, p. 87 . No work is done, hence the feeling of fatigue must be due to 
other causes. The system is analogous to an electromagnet carrying a weight. 
No work is done — all the energy consumed appears as heat in the windings. The 
fact that energy is dissipated at all has nothing to do with supporting the weight, 
and if the windings had zero resistance no energy would have to be supplied. 

Ex. 36, p. 88. U = +hr 2 /2, since grad U = +krr 0 = kr. 

Ex. 36, p. 89. Take the fixed axis as the 2 -axis. Then 

F = [i(r + \k) = m 


Multiplying vectorially by r , 

% 


Xf i[rk] = m 
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The projection of the directed areas on the plane normal to the £-axis is ob- 
tained by taking the ^-components of the auxiliary vectors. Scalar multiplication 
by k yields 

which proves the theorem. 



Ex. 37, p. 89. If, in equation (9), p. 83, the derivatives with respect to t are 
replaced by those with respect to <j> according to the rule 

d d r 2c d 

dt~ db'V-'r* d$' 


a simple computation leads to 



4c 2 Fd 2 { 1/r) 

r 2 L d ( |> 2 



For r = p/( 1 — e cos <j>) we obtain b r 1/r*. 

Ex. 38, p. 94. In the position of equilibrium x 0 we have 

= 

* 0 i# x 2 

For a displacement e, we have, to a first approximation, 


( 1 ) 


i.e. the total force is 


F, = F«> + s ] 

x o 1 

*p • • • • 

... (2) 

'-(3-3)— S* • • • 

... (3) 

“ 

... (4) 


whence 


m being the mass of the particle. 

Ex. 39, p. 100. The maximum of the square of the amplitude involves the 
maximum of the function 

1 

ro a (co 0 a - <o a ) a + p»<o 2 ’ 
whioh lies at to* = <o 0 * — (3 a /2m a . 

On the other hand, the kinetio energy at the origin is concerned with the 
maximum of the function 

co* 

m 2 (co 0 a — o> a ) a + p a o* 

on account of the derivative x. This maximum is at = <o 0 . It is to be observed 
that the vibrating system is non- conservative, for the friction dissipates energy; 
for this reason the potential energy at the points of reversal — whioh is propor- 
tional to the square of the amplitude — is not equal to the kinetio energy at the 
origin. The mean energy is 

® = ? a* (oV + *»), 


(B 711) 
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and this is a maximum if (neglecting small quantities) 


6) ^ <o 0 ^.m 2 10 ° 2 ’ damped* 


* ^0, p. 103. If a quadratic term is added to the elastic restoring force ~kx 

the vibration becomes unsymmetric, since changing the sign of z does not change 
that of the quadratic term, so that the force has different values at two points 
symmetric with respect to the origin. The differential equation becomes 


(J'X 

m 7,2 + + ex 2 = *' 0 cos at. 


* ( 1 ) 


As a first approximation, we assume 

x 0 — a cos at -f 6. 


( 2 ) 


The constant b allows for the dissymmetry. We insert this expression and de- 
termine a and b in such way as to make the constant term and the fundamental 
vibration satisfy the differential equation. Using cos 2 to* = £(1 -f- cos2<o0* we 
obtain the two equations 


and 


& 2 + ^ + *'/ 2 = 0 , 


a(w 2 - w 0 2 ) -2 Zab + F ° = 0. 

m m 


(3) 


(4) 


For e small, it follows from (3) that 


b = - 


2k 


(5) 


The other solution has e in the denominator and so is of no consequence, since 
we consider only small correction terms. Putting the above value into equation 
(4), we obtain a cubic equation quite similar to that for the symmetric case: 


_ a 3 + a(w ,_ Wo2) + _0 = 0 . 


. . . ( 6 ) 


This may be solved graphically in the same way, and under certain conditions has 
three real roots, so that the “ skip ” phenomenon occurs here also. For further 
details consult G. Duffing, Erzwungene Schu-ingungen bei veranderlicher Eigen - 
frequenz und ihre technische Bedeutung, Sammlung Vieweg Nr. 41, 42, Braun- 
schweig (1918). 

* Ex. 41, p. 107. Let z 0 be the height of the point of reversal above the lowest 
point. Then at any height z the conservation of energy gives 


4 (£)*= 9(H - *) ( 1 ) 

or 

^ = v 2 g{z 0 - z). (F) 


We seek to determine a as a function of z. Putting 8 — f(z), we have dsjdt 
f \z)dz/dl , and we obtain 

-L JM, dz 

V2 g Vz Q - z 


dt = 


( 2 ) 
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Integrating from z = 0 to z = z 0 corresponds to a quarter period, so that 

T_J_ r*. f'(z)dz 

^ 'Jo V^Tz (3) 

The integral in the right member should be independent of z 0 . The difficulty is 
that this parameter occurs not only in the integrand but also in the upper limit. 
To remove this difficulty we set 

2 = yz* (4) 

whence 

t L_ f 1 Vz of'(y z o) d y L r lv y z of'(y z )dy 

4 ^gj o VI -y V2gJ o V{iZTj )y * * * (6) 

The result is independent of z 0 if 

^ y z o • f'iy 2 o) — Vz • f\z) = const. = a (6) 

Integration gives 

f(z) = e = 2aVz (7) 

This curve is a oycloid, as may be shown readily: by equation (6), 

dz . 1 

Sa =smb = -Vz; (8) 

therefore 

a 2 

* = a 2 sin 2 <J> = — (1— cos 2$); (9) 

*-“» (10) 

or 

* = \ (24> + sin 2<J>) (11) 

(9) and (11) are the familiar parametrio equations of the oyoloid. This method 
of solution is given by Marcolongo (see reference, p. 869). 

Ex 42, p. 107 . Denoting the coefficient of friotion by p, the condition for 
equilibrium on the horizontal plane is mv*l 9l = y.mg, while on the cylindrical 
surface mg = (xmv a */p 8 . Elimination of p, yields 

*» = ^- V(p,p,). 

Substituting the numbers, v t = 45 km. /hr. 

Ex. 43, p. 110. Since the motion is such that the common centre of gravitv 

« Y? locate he there. Let m, be on the positive side, Z 

negative. At any instant we have ** 

= 0, (1) 

. *»-*! = « + * 

The equation of motion for to* is 

*** ~djt + *(*« ~ — 0 ) = ° (3) 


83f> APPENDIX 

Eliminating a;, by using equation (1), we obtain 

d2x * _i_ b ( 1 . 1 \ hi 

SP + Hm. + mj * = ^ (4) 

II, now, we put 

1 1 . 1 

f I == m 1 + m 2 ’ («) 

we obtain the ordinary equation of vibration 

d z x 2 k ka 

dt* + ^~m 2 < 6) 

but with a constant term which gives the rest position of m 2 , since a particular 
integral is 

x a (0) - gg = — ^ a. . . (7) 

m 2 m, + m 2 ••• \ I 

The general integral is 

x 2 = ar 2 (0) -f A sin(to 0 < — 8), to 0 = (8) 

Thus, in place of the mass m we have the “ reduced mass ” p. 

Ex. 44, p. 113. Since the kinetic energy of rotation about an axis is given by 

where d { is the distance of m i from the axis, the first quantity to be computed 
is the moment of inertia 1 = Tariff for the sphere. In spherical co-ordinates. 
d — r sin 0, so that 

*r - 2 /i pr 5 • 87T 2 Mr* ... 

1 = P / / / r 4 sm 3 6c/rd0rf4> = - . . (1) 

Jo Jo Jo O -3 6 

and 

Trot = (2) 

also, 

r.r.„. - (j*)’ (3) 


For rolling without slipping, we have the relation 

= rdfy; . . . 

then, by the conservation of energy, 

T = (£3/ + £3/) = 3/?$ sin a 

, /ds\* 10 

and so J = -y gs sm a. . . 


(4) 

(5) 
(0) 


If the body were to slide without rotation, the factor would be 2 in place of 10/7. 

Ex. 45, p. 113. Designating the velocities before impact by u, after impact 
by v, we have for elastic collision 

fn,u, -f- m 2 u 2 = 7 /i jU, + rn 2 v 2 (Conservation of Linear Momentum) (1) 
m,u, 3 -f m,u f * = m jt/j* + m t v 9 * (Conservation of Energy) . (2) 
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from which it follows that 


.. _ ( m i “ ”h) u i + 2 m 2 u 2 
v l ; , V. 

m l -f- m 2 


t — 


(m 2 — m x )u 2 + 2m x u x 
m, + in 2 


• (3) 


For inelastic impact, the energy equation is 

m i u i 2 + m*u* a = mpf m 2 v 2 8 + e 


(2') 


then 


" 2 + m Wl “ “ U * ]t ~ e (— + — ) ) 

t, 1== V V ”h w 8 y 


1 + — 


Ul +J w 2 + a/(«i ~ «*)■ - e (— + — ) 
y , = 7*i v Wj mj/ 


1 + 

mj 


. (30 


Ex. 46, p. 121. We have 


hence 


= -R a (d0 8 + sin 8 0 <4*), 

T = ^ (0« + sinafln 


and the Lagrange equations for conservative forces become 

mR 2 Q — mR 2 sin 0 cos 0 <j> 8 -f ^ — o. 

30 

mR* sin 8 0 <j> -f 2mi? 8 sin 0 cos 0 0<j> + — = 0. 

0 <}> 

fhA^'^ 7 ’ ft,? 46 ;. The i!? 0 ° f aotion for each force and for the resultant R is in 

on A n^K 1C p ^ force P 01 ^ 011 degenerates into a single vertical line 

of length R From an arbitrary pole O draw lines to the ends of this line These 

to the F <u ^d F <.o in fig. 5, p. 146. Through any rtuft on 

P araUel Fo.i and F« i0 , cutting the verticals drawn through the 
ends of the beam at pomts A and B. Through O draw a line parallel to AJB out? 

resultant* and then pl^d^aboT' “ ^ tot ** thoir 

a sphere; "hen” reo“ ££?££ “ S 

«££ Z^ihe^T Stt 


r +aj2 _+a/2 + a /2 

t J-*L*LnP + *>***- 


Ma' 
6 


parallel 
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the centre of gravity, comparison of the period of the 
of the compound pendulum yields 


simple pendulum 


with that 


- = 

g Msg 
s 2 - U + v? = 0. 


This is a quadratic equation for s, for a fixed value of l. The sum of the roots is 

8i+ s 2 = l. 


Ex. 50, p. 151. The period of a compound pendulum whose centre of gravitv 
is at a distance s from the point of suspension is 



v 


/ + Ms* 
Msg 


9 


where I is the moment of inertia about the axis through the centre of gravity 
and M is the mass. Putting I /M = a, we have 



whence 



This expression vanishes when a/s 2 = 1, i.e. when I = Ms 2 . 
value 



Comparing the 


for this case with the formula for the simple pendulum 

T = 2t ijl 

V 9 

we see that the equivalent length l must equal 2s. 


Ex. 51, p. 151. 

T = o 3 / + p/v 2 (1) 

/ = a l 2 M 1 + a 2 z M z , (2) 


where a x and a 2 are the respective distances of M x and M 2 from the centre of 
gravity. Then 

M 2 M,a , 0 . 

Cl " M x + ~M 2 a ’ ° 2 ~ M x + M 2 (3) 

I = tm 2 , (4) 

I = ± . J_. 

p. M x ~ M 2 

Ex. 62, p. 160. According to fig. 8, p. 156, the angle between k and k' is equal 


so that 
where 
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co (4> — 8). Then, from the magnitudes of the product vectors of equation (54), 
p. 160, we have mga sinS = Pco sin(4> — 8). Solved for 8, this is 

tanS = - 

4- cos (J> 

Poi 


* Ex. 63, p. 160. The deviation of projectiles is a very complicated problem 
It is treated in detail by 0. v. Eberhard in Auerbach-Hort, Handbuch der physik- 
ahschen und lechnischen Mechanik , Vol. 2. The principal effects are due to gyro- 
scopic action. For projectiles of usual form, the effect of the air resistance is to 
raise the front of the shell, i.e. there is a torque about a horizontal axis normal to 
the axis of the projectile. Since the axis of symmetry is the principal axis of 
inertia in this case, we see from the relation 


di dt 


+ [<oP] 


that the change m P, and hence the change in the position of the axis in space 
has the direction of the horizontal axis normal to the axis of the body For 
nght-handed spin, the deviation is toward the right when looking in the direction 
of advance of the projectile. At this stage the projectile is inclined to the trajec- 
tory and the forces exerted on it by its motion through the air make it veer to 
the right. However, after half a processional cycle the body is inclined toward 
the leit. It is often stated that the precession is so slow that the projectile has 
reached its goal before half a precession cycle is completed. This is not true 
ActuaUy the path of the tip of the projectile is not a circle ; because of the changing 
inclination of its axis and the resulting shift in the point of application of the 
air resistance force, the pomt describes a type of cycloid which is always to the 
nght of the tangent to the trajectory. The purely hydrodynamic Magnus Effeot 

(p. 211) operates m the contrary direction, but is small compared with the gyro- 
scopic effect. 


Ex. 64, p. 167. 


Ar _ Vr " a - v>» 
r ~~ Vr* 


so that 

A r du m 

r 


+ + %) “ iai + ^** +i (% + 1)0 


Ex. 65, p. 179. 



The moment of inertia of the oross-seotion ia 



The depression of the middle is then given by equation (66) (p. 178): 

h — Q » 3 



840 APPENDIX 

Ex. 56, p. 179. Take the origin at the uppermost, point of the surface. At this 
point we have 

d 2 z _ 1 

dz*~ p (!) 

d 2 z 1 a 

ey*-p' P <2) 

dz dz 

01 = ^ = Z = O (8) 

Integrating equations (1) and (2), and using equation (3) yields the equation of 
the surface: 

2 = 2 p ~ W 

The contour lines z — const, are hyperbolas whose asymptotes form an angle a 
with the x-axis. This angle is given by 

tan 2 a = i . (5) 

Ex. 57, p. 180. Of an elementary hollow cylinder of radii r and r -f- dr, con- 
sider an element included between the polar angles 0 and 0 -f dQ. In the process 
of torsion, the upper face of this element is displaced a distance r§ with respect 
to the lower face, which is at a distance l from it. Then, by p. 173, there will 
be a restoring force of amount d 2 F = SrdrdQrfy/l, whose moment is d 2 M = 
rd 2 F == Sr 3 $drdQ/l. Integrating over the entire ring, we have dM = 2nSr 3 dr$/l, 
and integrating over all layers from r = 0 to r = a yields M = (7rSa 4 /2J)<j> = t§. 
t is called the moment of torsion of the wire. 

Ex. 68, p. 190. Since z must vanish at the boundary at all tiines, the par- 
ticular integral is of the form 

. mnx . nny „ • * 

z = c mn sm sm e 2 *wm»i* (1) 

The differential equation is satisfied if 

•-.-i B > 

whero p is the density of the material of which the membrane is made. Equation 
(2) thus gives the possible natural frequencies. As in the case of the string, we 
write the solution, which must satisfy the initial conditions, as the sum (in this 
case, double sum) of particular integrals: 

00 . mnx . nny 

z = Z> X" A mn sm sm cos27rv nm f 

0 0 a o 

x V. mux . mvy . _ /n . 

+ B mn sin sm -jf- srn27rv nm < (3) 

ou a o 

The coefficients A mn arc obtained by the method given in the text: 

4 r a r h , , v . mnx . nny 

= ab £ — .m-j-** 

2 r a r b . mnx . nny , , 

L J 0 '•'<*■ y) 9m ^ 8m V 
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where f 0 (z, y) represents the initial form of the membrane, and f 0 '(x, y) gives the 
initial distribution of velocities. 


Ex. 69, p. 194. The force acting on an element of surface dS whose normal 
is directed outward is dF = — pdS. The minus sign is due to our convention 
regarding the sign of p. The resultant is 

F = — §pd& = — J gradjs dx (1) 


If is the vector gravitational force acting on 1 cm. 3 , 
(p. 192), 

F = — J P ffdx = — g Jp dr. 


we have by equation (la) 
(2) 


Ex. 60, p. 197. Take the z-aads vertically downward. If the cross-section of 
the vessel is large and the orifice small, the flow may be considered stationary. 
Since the only velocity component is one in the direction of the z-axis, we have 
by equation (10a) (p. 196), 




whence, by integration. 




According to our conditions, we may take v = 0 in the vessel itself and may 
assume that p is equal to the hydrostatic pressure pgz. Then C — 0. At z = h, 
i.e. at the orifice, p vanishes, so that 


v h = V2 gh. 


This is Torricelli’s Theorem, according to which the fluid emerges with a velocity 
equal to that which it would attain by free fall through a distance h. 


Ex. 61, p. 197. Taking the x-axis in the direction of flow, we have 



dv 1 dp 

V dx~ p dx • • • « 

• • • • (1) 

and 

P _Pi 

A . * *••••« 

P Pi 

1 • • • • (2) 

Hence 

dv p 1 I dp 

• • • • (3) 


dx . Pl p dx 

Integrating, 

— p + O. . . . 

z Pi 

• • • • (4) 


We may set v = 0 within the vessel; this determines 


G = ^ log p v 
Pi 

Then the velocity of efflux becomes 



For another gas under the same conditions of pressure. 


<1-Pi 


Pi 


28 * 


o' l0 «;r 

Pl Po 


( 5 ) 

< 5 # ) 
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Hence 


V = P/ 

*V 2 Pi * 



This relation is used to determine the density of a gas in the instrument known 
as Bunsen’s Effusometer. 


Ex. 62, p. 213. For the case of water the difference between adiabatic and 
isothermal compressibility may be ignored. We thus have 


P = p(I + xp) = p(l + a), (1) 

gradp = - grad a, (2) 


whence, as in the text, we obtain the wave equation 




so that 



1 



To obtain v in cm. /sec. we must convert x to (dynes/cm. 2 ) -1 . Wc have x = 
60/(1013 X 10 12 ), hence 

v = 1423 m./sec. 

The two compressibilities differ considerably for some liquids. For example, 
ethyl bromide has c p /c v = T87. 


* Ex. 63, p. 227. We start from equation (115) (p. 226). For the case of 
rotational symmetry, this equation becomes, after putting <I> = gz, 



(the negative sign is to be taken for a concave surface). If 2 is measured from the 
surface of the liquid in a large vessel, X must be zero, since dzjdr and d^z/dr 2 
vanish. Remembering that the height of rise will be large compared with the 
dimensions of the meniscus, we may, approximately, take 

2 = Zo -4- C ( 2 ) 


and neglect £ in comparison with z 0 . Then 


a d 
r dr 



90 z o 


Since d^jdr = 0 for t =0, integration yields 


dK 

dr 

w? 


PW 
2a ' 



(4) 
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The expression on the left is the value of cos 0 for r 
(cos 0 = I), 2a 


= r 0 ; in the present case, then. 


z o = 


PfPo 


(5) 


where z 0 is the mean height of the column. The same result is obtained more 
simply by noting that, since the liquid wets the tube completely, the surface 
film supports the weight of the ascended liquid. The total upward force at the 
film boundary is 27tr 0 a, while the gravitational force on the liquid in the tube 
is Ttp r 0 2 g. Setting these quantities equal, we obtain (5). 


Ex. 64, p. 227. The pressure is the same on both sides of a film, hence we may 
set p = 0 in equation (110) (p. 224). But the resulting equation. 



defines a family of minimal surfaces, i.e. surfaces having the smallest area for 
a given boundary curve. The rotational surface of this family, which is sought 

here, is the catenoid — a surface obtained by rotating the catenary y = cosh x 
about the x-axis. 


Ex. 65, p. 235. According to the results of p. 106, a pendulum swinging with 
small amplitude behaves like an oscillator with a “ quasi-elastio binding con- 
stant ’ k = mg //. By equation (18') (p. 234) the equations of motion are 


and 


m + {ex' — 2m o sin <{> 

™ ^ + ty 7 = — sin 4> 


(1) 


( 2 ) 


which may be combined into one equation for the complex variable (x' -j- iy*): 


dfl & + */) + ~ (** 4- •/) = — 2»g> sin <J> ^ (x' -f *y). 


(3) 


iii wwuco a system oi co-ordinates which turns with respect to the x / , t/ system 

wth the angular velocity <o sin<J> in a sense opposite to the rotation of the earth 
Ihen we have 


x' -f- «y = (£' -f- $y)e~i«wsin4< 

e * <osin ^ < ^ (£ r + *Y) — *o> sin 4* •7) / )e~*«sin4*, 
d 8 


#& + *y) = «- 


(x / 4- »y) = g-iwsin** 


Neglecting «*, 


jp (£' 4- *>)') — 2»co sin<J)e-*«s>n4* £ (£' 
— sin*<J>(5 / 4- . 


L 

dp + »Y) 4- - (^ 4- it)') = 0 


(4) 


(5) 


Z&ZR* Wrth> appeara *> turn from east to wL with the angSS 
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66 ; P ' f 35 * ^ the fundamentaI equation (7), p. 110, is resolved into 
components along the system of principal axes of inertia fixed in the body it 
must be remembered that 


_ d r P 

dt ~~df + [wPJ and ~ = 


dio _ d r co 
dt dt 


How 


ever, in this system P x - = I^ x -, P y = / n o v , P 3 - = s0 that 


d(jiy' 


M* — A (/„ — / UI )coy o >y, 

My’ = — (/in — /i)co s 

M s - = /, 


' Wx-* 


in 




— (/i — /n)w.v' <oy. 


These are known as the Euler equations. If M = 0 they are satisfied if 

wy = const., coy = coy = 0, 
or toy = const., co x ' = coy = 0, 

or toy = const., coy = coy = 0; 

that is, in the absence of any torque a rigid body can continue to rotate about 
one of the principal axes. Stability considerations show that rotation about 
either the axis of greatest or of least inertia will be stable, while that about the 
middle one will be unstable. 

Ex. 07, p. 258. The energy radiated in one year is 

4tt x (150) 2 X 10 22 X 2 X 4- 19 X 10 7 x 00 X 24 X 305 ergs; 

dividing by c 2 = 9 x 10 20 cm. 2 /sec. a , the decrease in mass is determined to be 
138 X 10 12 metric tons per year. 

Ex. 08, p. 209. No, for a field of this kind is not irrotational; cf. fig. 19, p. 20. 
Ex. 09, p. 209. 

o = —2 05 x 10 -4 e.s.u. = —8 S3 x 10~ 14 coul./cm. a 
p = 1-32 x 10 -9 e.s.u. = 4-40 x 10 -19 coul./cm. 3 

Ex. 70, p. 209. Graphical differentiation of the curve yields — E; a second 
differentiation furnishes the value of — 4^p. The differentiation is most simply 
performed with the aid of a mirror rule, which may be set very accurately per- 
pendicular to the curve simply by observing when the reflection is a smooth 
continuation of the curve itself. The angle which the rule makes with the y-axis 
is equal to the slope of the tangent. 

Ex. 71, p. 270. If the two plates are a distance d apart and if they have a 
difference of potential V, the uniform field between them will be of strength 
V/d. Apart from the slight non-uniformity at the edges, the external field is 
everywhere zero. This may be looked upon as the mutual cancellation of the 
oppositely directed lines of force from the two plates. Between the plates, on 
the other hand, the number of lines is doubled. According to the footnote on 
p. 2G(), the force exerted by one charged plate on the other is thus given by 
F = Ve/2d, and since C = e/V = S/lnd, we have F = 2ne 2 /S or F = F*jS/8ad*. 
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Ex. 72, p. 276 . Since the situation is independent of the polar angle and of 
the z-co-ordinate, we have, in cylindrical co-ordinates, 

<» 

The solution is V = A logr + B t (2) 

whence V a = A logo + B, V b = A \ogb -f B, . . , . . (3) 

from which it follows for V b = 0 that 

log £ 

7 = V '-i ( 4 ) 

log? 

The field is then given by 

E = ^ a ^9 —■ Z.9 * ,r.\ 

, a' r ~ r b (5) 

log - b log - 

and the charge e per unit length by 

§EdS -- 4rce = V ° ' 2 ™ ( 6 ) 

a log I 

Thus the capaoity is 

2 log a 

The solution for the parallel plate condenser is 

V = Ax + B, E = ■?, hence O = .... ( 7 ) 

The distortion of the field at the edges is negleoted. 

Ex. 73, p. 276. Call the radius of the sphere a and let the image point whose 

oharge is — be a distance R' from the centre of the sphere. Then 

F = r”P U> 

The value of V must be zero on the sphere, i.e. 

e' r* 



Introducing the angle 0 between a radiua of the sphere and the line joining the 
external oharge and the centre, we have ® 

«'* _ *■'* _ -8'* + o* — 20 ^ ooa 0 S'* + ~ 20 001,6 

•* •* ,K> + «»-2afiooaO " s * (3> 

is + ^ — 2a oos 0 



846 


APPENDIX 


This relation must hold for every angle 0; this is true if 


R' e' 2 


a 2 = R'B, j t =~ 


(4) 


which determines the position and charge of the image. 

* Ex. 74, p. 276. Taking the origin of co-ordinates at the point under con- 
sideration, the power series development of V at this point is 

V — V 0 + «,* 4- a^y + a 3 z + a n x 2 + a 22 y 2 4- a 33 2 2 

+ 2 a l2 xy + 2 a 23 yz -f 2 a. n zx + ( 1 ) 

The product terms may always be made to vanish by rotating the axes; in this 

system we have 

% 

V = V 0 + b x x + b# + b 3 z -f b u x 2 4- b 2i y 2 + b^z 2 -f (2) 

The components of field strength are given by 



These all vanish at the singular point. In place of the equation of the tangent 
plane at an ordinary point, we have here the equation of the tangent cone 

b u l 2 + b 22 r 2 4- &3*S a = 0, 

where £, yj, £ are the running co-ordinates. The nature of the singularity depends 
upon whether the cone is real or imaginary. Now the potential equation 

A !■ = 2(6 n 4- b 22 -f 6 33 ) = 0, i.e. & 33 = — (&,, 4- b 22 ) 

always obtains. This guarantees the reality of the cone and the only possible 
singularity is of the nature of a conical point (corresponding to a real double 
point in plane geometry). For the case of rotational symmetry the angle of this 
cone is determined. If b 22 = b n , we have & 33 = — 26 n , and the half-angle of 
the cone is given by 

tan a — V 2. 


This condition must be satisfied for the tangents to the potential curves in the 
meridional section at the “ abaric point ”, e.g. between the earth and the moon 
or between two point charges. 


Ex. 76, p. 284. The case of a uniformly polarized sphere was treated in the 
text. The result was an additional field — 4 ttP/ 3 due to the free charges on the 
surface of the sphere. In the present instance the homogeneous polarization is 
present outside the sphere, hence div P has the opposite sign, and so the free 
surface charges are equal and opposite to those arising in the case of tho uniformly 
polarized sphere. Thus their field is given by 



47tP 

~r* 


Ex. 76, p. 284. At any point, D r = e/r 2 ; moreover, for 

1 e 
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i.e. 


and since 

0 = f> 


y a - y; 

we have 

1 - 1 1 +i( 1 

C K x a tf 2 c 6 \K 2 




Ex. 77, p. 284. We give the image a charge — e'. For the external space we 
may set 



without contradicting the relation j E dS = 47te, since e' is not in this region, 
and a closed surface in this space does not enclose the charge which is really 
non-existent. But within the dielectric we must put 



e" 

Kr 


The quantities e' and e" are at our disposal for satisfying the boundary con- 
ditions. 

The prescribed discontinuity of the normal component yields e -f e' — e" = 0. 
The continuity of the tangential component is certainly satisfied if V 1 = V 
at all points on the surface of separation, for if we form the derivative dV/dl 
along this surface, it will have the same value on both sides if V l — V r This 
relation furnishes the second equation connecting t' and e"i 


e 




whenoe 



K - 1 
e K + I 


and e" = 2e 

i + r 


Ex. 78, p. 286. The force acting on the charge -e is -eE, that on the charge 
+ c is e(E ■+• ds grad • E ) ; hence the force acting on the dipole is 

F — m grad • E. 


Ex. 79, p. 290. The density of surface charge is 

e_ C(V 1 -V i ) 

S~~ S 

and the energy is 

v = i/(P, - rjadS = $C(P, - P s )« 

or, denoting the potential difference by V, 

U = 

D the charge is held constant, V' = V/K, C' = KC and O' = U/K, so that 
^ ^ ~ 1 (ji ~ l) CF* = ~2K~ ^ or a P ar ®Uel-pIate condenser. 


A 1 ~ V d r, I- Sd = T, in conformity with 

(40), is proved. If the potential difference is held constant, U' = KU. 
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Ex. 80, p. 290. The value of the potential inside the sphere as given by 

equ. (25), p. 272, must be inserted in (31'), p. 287, and integrated from 0 to a 
One then obtains „ 

y __ ltai 2 p 3 e 2 

~ 15 5 a* 

where e is the total charge. 


Ex. 81, p. 292. Call the distance between the plates z, and measure z from the 
level of the lower plate, which is fixed in position. Then, if z' is the height of the 
counterweight, 


U — V t \ + U 


mcch 


KSV* 

2 ^ +mgZ - 



For equilibrium and for a constant charge (in this case only is the total energy 
given by (1)), i.e. for 


KSV 

4ttz 


const., or - ^ FSz + ^ Sz = 0, 


4tzz* 


4 nz dz 



we have 8U = 0 = + Sz -f- rng8z' 

Snz 2 

Since 8z' = —8z, we have, for z = a: 

_ KS V * 

Sizga 2 ’ 


(3) 

(4) 


It appears strange that K occurs in the numerator, while the force of attraction 
between given charges diminished to 1 /K times its former value when a medium 
of dielectric constant replaces one whose value is unity. In this case, however, 
it is not the charge but the potential difference which is held constant. Cf. also 
Exercises 71 and 79. 


Ex. 82, p. 295. If a quantity of electricity de is removed from the condenser 
in a time dt its potential difference changes by 



ft. 

II 

1 

• 

• 

. . . (i) 

By Ohm’s Law, 

• 

II 

II 

... (2) 

Combining, 

• 

MO 

1 

II 

^3 

... (3) 

Integrated, this gives 

~ + log v " or v = v ° e t/CR ’ 


log F = 

... (4) 

Hence 

/ = Yje-t'CR 

R 

. . . (5) 


Ex. 83, p. 295. As in the preceding problem, the process of charging the 
oondenser is found to proceed according to the equation 

F = F 0 (l — e-t/CR) (1) 
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For t — T we must have V = V 8 ; hence the time of charging— which, for in- 
stantaneous discharge, must be equal to the period of the pulsating current is 


given by 


T = RC log 


Vo - v; 


Ex. 84, p. 300. A uniformly magnetized sphere may be replaced by a mag- 
netic doublet at its centre. Its potential is 


> 


From this. 

i 7 wiy 

r 3 

. . . . (i) 

whence 

H = — grad V = — ™ 4 . ® my . Ty . 

.... ( 2 ) 


jj m sin <}> 3m sin cf> 2msin4> 

r r 3 *" r 3 r* * 

.... (3) 


U m COS<{> 

— rS > 

• • • • (4) 


tan 8 = — 2 tan <{> 

^tan 

.... (5) 


Ex. 85, p. 300. It follows from 




H ds = — = 0-4*7 
c 


(/ in amperes). 


that a current whose density is 3-9 x lO' 12 amp/cm. 2 flows into the surface of 
the earth from above. This current is many times the normal atmospheric cur- 
rent, and no adequate explanation of its presence has yet been given. Moreover 
the reality of the vortical part of the earth’s field is doubted. 

Ex. SQ, p. 305. Prom considerations of symmetry it follows that the magnetio 
lines of force are circles whose centres lie on the axis of the oylinder and whose 
planes are normal to this axis. It follows from 




H ds = 


4*7 


that, for a constant ourrent density i inside, 

4wW . 


outside. 


2ktH = — — — , i.e. = 


2 i,. B. 


2nri 

_ • 

c * 

. 2kJRH 
rc 


27 

ro’ 


where r is the distance of the field point from the axis of symmetry. 


position 


. *• 87, p. 310. If the coil has N turns, 
acting on it in its norm 
tion (29') (p. 309), 

HSHI / NSH\ 

—— = qI , (1) 
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The equation of motion of the coil whose moment of inertia is J and whose 
damping constant is r is 


«/4> -f- r§ + t<}) = ql , 


( 2 ) 


where t is the moment of torsion of the suspension. Under the assumption that 
<{) does not depart appreciably from zero while the current impulse is applied, 
equation (2) may be integrated, giving 


= q[l dt. 


or 


( 2 ') 


Thus the angular velocity is proportional to the total quantity of electricity 
which flows. We then have this value of the initial velocity in the equation 


+ r<{> -j- x< 4 > = 0 


of the free, damped vibration of the coil. The solution, which yields 
t = 0 is 

<{> = Ae~l r W) 1 sin 


For t = 0 we must have 


; qe V4t J — r 2 . . 2ge 

“ J ' 2J l e ’ A ~ V 4t J - r 2 


= 0 for 


(4) 


( 5 ) 


The first maximum deflection will be reached at a time t m determined by 

„ r . V4 t J — r 2 V4 xJ — r 2 x 4rJ — r 2 

° = - 2J 8m 2J + —27— C ° S —2J— ^ 


(0 


. V4t J — r 2 VlxJ — r 2 

ton — 27 r 


(«') 


For the usual case of weak damping, r<C 4 t J, and the right member may be 
set approximately equal to tan7t/2, and we have 

7T J 

= ( ) 


whence 


■ $ e -rnliViJ 


• ( 8 ) 


It should be noticed that the damping constant r is determined largely by the 
induced current resulting from the motion of the coil (cf. Ex. 88). The gal- 
vanometer constant” C, the number by which the observed deflection must be 
multiplied to give the current or quantity of charge, is usually stated. Neglecting 

the damping, the ballistic constant C b is, by the last equation, v xJ /q, while 
the constant for steady current, C 0 = x/g, by equation (1). The two constants 

are related approximately by C b = C 0 V J/x == C 0 T/ 2tt, where T is the period of 

swing. 

Ex. 88, p. 313. In turning away from the rest position, in which the normal 
to the plane of the coil is perpendicular to H, the change in flux in a medium of 
permeability 1 is given by 80/ dt = NSHt j>. Hence the induced voltage is 
y { — _ NSHfy/c = — g<j> and the induced current is I t = —q$/R. The torque 
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noting on the current-carrying coil amounta to M = ql, = -q*A>IR. The damoinp 
constant r in Exercise 87 thus has the value q 2 /B. P g 


Ex. 89, p. 316. Cf. p. 664 et seq. 

* Ex. 90, p. 316. 

(a) We have 




2 n 


a x cos <j> d§ 


V a 1 a -f a 2 8 — 2a x a 2 cos <{> + z** 


to our case the angle between ds x and ds 2 is equal to the polar angle <f> 

Maxwell succeeded in referring this integral back to tabulated elliptic integrals. 
The formuke for this conversion may be found in Gratz’s Handbuch der Elek- 
trxzxtat und des Magneturmus, Bd. V, p. 65. We confine our considerations to two 

circles of almost equal size whose distance apart z is small. Introducing the 
largest and smallest distances ® 


p* = z* + (a x — a 2 ) 8 and q 2 = z 8 + (a, -f. a 2 )» 


we have 


q z -p' = 4a ^ r 14 * = p 2 -f (g 2 - p 2 ) sin 2 * 
eo that the integral becomes 


4 


Since, according to our assumptions, q »p, we can neglect p as compared 
mth q for larger values of the angle 4>. This is not allowable when 4 . is small 
smce g» is multiplied by sm’4>/2. We thus divide the range of the integral toto 
two parte — one for small values of (0 to t.h* nfimr i , . „ 




/ 

Jo 


a x d$ + 2 ™ 2 t)d$ 


^Jp* + q 1 ~ 


<b 

?sin| 


? \ ” V 
or, since z<£ a v 2^/qm 1, 




lu'w^.lno, (log 


80 , 


+ («i — flfl)* 


=j- 


rhis formula was given by Maxwell. 
(6) On account of the expression 


1 f 

8 nj 


E*dx 

fi «f? ener gy is a positive definite quantity. If we use the 
quadratic form of equation (13) (p. 316), ite discriminant must always be positive, 





852 


APPENDIX 


The result may be seen physically by considering the flux which a given coil 
causes to pass through itself and through the other coil. The most favourable 
case, e.g. when both coils are wound upon the same iron core, gives L n L 22 = L 12 *. 


Ex. 91, p. 319. The familiar relation for resistances connected in parallel, viz. 


1 

R 



1 



holds also for complex resistances, i.e. reactances in the complex plane; hence 

1 _ 1 
R ~ 


-f- ioid f 


R = 


tcoL 
tcoL 

1 - co 2 LC‘ 


Thus for co = 1 fV LC (resonance) we have R = co. 


Ex. 92, p. 319. Since with increasing field the permeability of iron fz=|B|/|H| 
decreases, the self-induction of an iron-cored coil will also decrease with increasing 
current. Hence the natural frequency of the circuit increases as the current 
grows. The details are given by Schunk and Zenneck, Jahrbuch der drahtlosen 
Telegraphie , 19, p. 170 (1922). 


Ex. 93, p. 332. From S = c[EH]/4tt it follows that the mean energy flux in 
a vacuum (| E | = | H |) is 


_ c 2 X 419 X 10 7 

S = 8i E " “ 60 Crg3 


whence 



* Al*-! ll? . 10" 2 e.s.u. = 10 volts/cm. 
3 V 2 


* Ex. 94, 342. Both terms of equation (43) (p. 337) must be taken into account 
in the transitional region, so that 

U = [*Vo] g,Uj( ‘- r/ ‘ +i) 


where 



— tan -1 
co 



Hence the phase angle 


is 

4>(«. r) = co (f - ^ 


1 tan-i^). 
g> c or/ 



The wave velocity is that of an observer r, who moves in such way that 4> - const. 
Thus, by differentiation, we obtain 




or 



♦ 
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Ex. 95, p. 354. We have 0 = 45°, 0" = 28° 7-5', I s = 0 046 7 0 , I n = 0-0042 1 0 , 
whence P = 83 per cent. 


Ex. 96, p. 354. We perform the calculation for normal incidence; the pro- 
cedure is entirely similar for oblique incidence. In a vacuum, | H J = | E |, in 
the dielectric | H | = n | E | . Also 


i E 'i = ihn E ' E 


2 

n -f- 1 



so that we have (to a common factor) 


j S | = E 2 , 



from which we see that S = S'|-f S". It is to be noticed that this balance is not 
obtained by simply setting S | proportional to A 2 in each medium, since the 
factor of proportionality contains n. 


Ex. 97, p. 356. 


tan 3 _ cos 0 Vsin 3 0 — n* 
2 sin 2 0 


must equal unity. 


In this formula 


n signifies the index of refraction for the ray emerging from the medium into air, 
i.e. n < 1; if we put n = 1/n', n' is the usual index of refraction of the material. 
We seek the smallest value of n' for which tan (8/2) can still be equal to unity. 
This extremal problem for n = / (sin 0) then yields the equation 


2 . sin 4 0 

n z = sm- 0 

cos 2 0 




which is an extremum for n' = 2-41. 

For two reflections 

tang = tan”= V2 - 1 = P 030 ^ 6 ~ 

2 8 sin a 0 

For n = 2/3, this gives the two angles 

O^SO^O' and 0 2 = 53° 10'. 

* Ex. 98, p. 376. It follows from considerations of symmetry that the curves 
of equal difference of phase are circles. The position of the planes of vibration 
within the crystal may be found as follows. The wave surface corresponding to 
the ordinary ray is a sphere, and the same is true of the normal surface; hence 
in equation (26) of the text (p. 365) we put = tta for a uniaxial crystal. If we 
take the a;-axis along the optic axis, we have D x — 0 for v = i.e. D is normal 
to the axis. But Bince we always have v — Vn for the ordinary ray, the plane of 
D is always normal to the axis. In addition, D is in a plane whioh is normal to ft, 
i.e. it is determined by the line of intersection of a plane perpendioular to ft and 
one perpendicular to the axis. Thus, for all incident rays corresponding to a 
given plane of incidence, D is in the same direction, viz. normal to the plane of 
incidence. Now, if the plane of incidence is normal to the direction of trans- 
mission of the first Nicol prism, only the ordinary ray will be present in the orystal; 
if the inoident plane is parallel to the direction of transmission, the ordinary ray 
will be absent. Hence the isogyre is a right-angled oross. 

Ex. 99, p. 390. If the index diagram be drawn as in fig. 4, p. 383, whioh cor- 
responds to this case, the amplitude of the direct image will be Af2, where A 
tfl the amplitude of the inoident light. The amplitude of the first order diffraction 
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pattern will be A In; the orders 2, 4, ... 2 n yield zero, since in these instances 
the vector diagram is closed for any grating space. The third order, on the other 
hand, yields A/Zn — in general, the odd orders have amplitudes A/(2n -f- 1 )tt. 
The corresponding intensities are 

/ = d 2 / = * r - A * r _ 

0 4’ '*“ 9 **’ ‘ * * ' 2 " +1 ~ (2n + W 2 ' 

The combined intensity of the direct image and the diffraction patterns on both 
sides is 





d - • . . d - 


1 

(2 n d- 1 




A 2 
2* 


Since the intensity of the incident light is given by A 2 , and since half the energy 
is absorbed, the energy balance is maintained. Thus all the diffraction patterns 
together have the same intensity as the direct image. 


Ex. 100, p. 390. The distance b between the slits is large compared with the 
slit width a; as a result, the diffraction pattern of the single slit is crossed by 
maxima and minima whose separation is given by (a — a 0 )b = h jX. For normal 
incidence, a = h^k/b; for almost normal incidence, a" = h 2 \/b + e. The best 
visibility of the fringes is obtained for 




h 2 \ 

b 


+ e or 6 = 



If 6 is changed, two adjoining distances for which the best visibility is obtained 
are separated by A b = X/e. This relation is of use in measuring the distance 
between two stars which are close together and in measuring stellar diameters. 
In the latter case e is approximately the angular distance between the centres 
of gravity of the two halves of the stellar disc. 

Ex. 101, p. 390. From equation (30) (p. 387) it follows that 

qin ® 171 V H *2 _L 11 *2 _l_ U *2 

2~f06 1 + - + 3 ' 


We limit our considerations to the reflected rays which are normal to the axis 
of rotation; for these H 3 * = 0. Then for 


H* U * 

1 0 

1 0 

1 0 

1 0 


n 0/2 

1 14° 20' 

2 29° 30' 

3 47° 40' 

4 80° 00' 


H* H * 

1 1 

1 1 

2 1 


m 0/2 

1 20 ° 20 ' 

2 44° 10' 

1 33° 20' 


The ray reflected from the plane H x * = 2, H 2 * = 0 coincides with the second order 
reflection from the plane H x * = 1, H 2 * =0. Similarly, the angles resulting by 
interchanging H x * and U 2 * coincide with those already contained in the above 
list, and so are not set down separately. 

Ex. 102, p. 403. It follows from y' =/ tana that the diameter of the image 
is 1*74 cm. 

Ex. 103, p. 403. Let the plane in question cut the axis at P and the principal 
plane on the object side in the line G. The point P has a corresponding image 
point P r related to it by the image equation, while to G there corresponds a parallel 
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line G' in the principal plane on the image aide. G' and P then determine the 
image plane required. 

Ex. 104, p. 409. Project the incident and refracted raya on the given plane 
and describe a unit sphere about the intersection. The normal to the incident 
surface, the rays themselves and their projections on each side of the refracting 
plane define a right spherical triangle. Calling the angle of incidence a, the angle 
of refraction a", the corresponding angles with respect to the given plane 0 and 
0", and the angle between the given plane and the plane of incidence 6 the 
sme law of spherical trigonometry yields sin 0 = sin a sin 3 and sin 0" = sin a" sin A 
whence sin 0/sin 0" = sin a/sin a" = n. 

* ^ x ‘ P* f ^° eac ^ 1 P 0 ^ 1 ^ th® slit there corresponds a parallel beam, 
and each of these beams corresponds to a point in the focal plane of the camera 
Only the rays coming from the centre of the slit pass through the prism in a 
direction perpendicular to the edges; in fact, it is usually only the path of these 
rays which is reproduced in diagrams. The rays coming from the upper and 
lower parts of the sht pass through the prism obliquely, and a computation shows 
that their deflection is greater than that of the central rays. The result is that 
the spectral lines are curved, the convex side being toward the red. The analysis 
Is given in H. Kayser’s Handbuchder Spektroakopie, Bd. II. p. 260, Leipzig (1900). 

Ex. 106, p. 409. Taking account of the signs, the focal lengths of the front 
and back lens surfaces are 



If d is the lens thickness, the interval D = — -f- d -f f z , so that 

nr i r t 


f = J 4' = ~ r-^TTB «d /' = -UK = 

D (n - 1)12 D (n - 1 ) 12 * 


where R = n(r 2 - r,) + (n - 1 )d. 

Ex. 107, p. 409. Write the sine condition in the form 


sum _ n' y' 
sinu' n y’ 


This relation is, obviously, also valid in the region near the axis, where the sine 
and the angle are interchangeable. Consider a ray from an axial point, distant a 
irom the lens, and making an angle u with the axis. Let h be the distance from the 
axis at which this ray meets the first surface. Then sinu = h/a . For a very dis- 
tant point, a may be replaced by x, the distance from the principal plane on the 
object side Furthermore, since y and y' are small, y'/y may be replaced by the 
relation y'/y = —/fa whioh holds for points near the axis. We then have 


h 

sinu 



ne consequence of this relation is that the largest possible value of the ratio 
ot the diameter of a beam parallel to the axis to the focal length will be 2, if the 
sme condition is to hold. It is more uBual to speoify the relative aperture of an 
oojeotive, which is defined as the reoiprooal of the above quantity. Thus the 
limiting value of the relative aperture will be “//0-5 • 

Ex. 108, p. 410. Let the prism be completely filled with light and in the posi- 
ion of minimum deviation. According to the principles of geometrical optics, the 


856 


APPENDIX 


optical paths along those parts of the extreme rays included between planes normal 
to the beam must be equal. Calling the base of the prism b and the path in air 
of the ray through the vertex l, we then have 

nb — 21 = 0. 

Also, the limitation of the width of the beam is responsible for diffraction such 
as would be produced by a slit of the same width. The first minimum of the 
diffraction pattern falls in a direction for which the difference of the optical paths 
of the two marginal rays is X, for then, as in the case of a slit, the two halves of 
the beam annul each other. This occurs, then, when 

nb — 21 = X ( 1 ) 

If the direct image for a neighbouring wave-length falls in this position, the two 
spectral lines can just be separated, and we have 

nb + b^d\-2l = 0 (2) 


Subtraction of equation (1) gives 




Thus the resolving power of a prism is equal to the product of the base by the 
dispersion of the substance of which it is made. 


Ex. 109, p. 413. By equation (31) (p. 411) the path difference is 



2nd 
cos a' 


— 2d tan a' sin a = 2d N n 2 — sin 2 a. 


Ex. 110, p. 424. 

(a) The attraction between two electrons at a distance apart of 10~* cm. is 

, (4-80) 2 x 10“ 20 OOA 1A a . 

/ = jQTTe = 2 30 X 10- 3 dynes. 

(b) The attraction between the ion groups is given by 

/ = <6 02)2 S ^ ‘°"° = 5 3 X 1010 ^ = 54 metH0 tonS - 

(c) The volume density of the charge in a sphere of radius r is 


P = 


3 F 

4 nr 3 


where F is the Faraday equivalent charge. The work needed to add a shell of 
thickness dr and density of charge p to the sphere is 


dW 


= 4rfp 4j»Vr = v 


To build up the entire sphere from widely scattered charges requires an amount 
of work 

w =f* P *r*dr = = I j r 
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This is the potential energy of the sphere (cf. Ex. 80), 
expansion dR , the energy decreases by 

_ F* 


If we consider a virtual 


This change in energy is equal to the work done by the total pressure in the 
virtual expansion: 

pdv = p . 47 xR 2 dR. 

Equating the two quantities, 

3 1 ^ 3 X ( 9649 )* x 9 X 10 20 , 

P = g — — - A KS 1 ASH J- / • J . . ... 


V 6 4ttI2 4 5 X 4tt X 10 4 — 4 x 1023 d y nes / cra - t ^4x 10 17 atm. 

Ex. Ill, p. 431 The electrical deviation after traversing a condenser of length 
a is, by equation (8), g u 


Acl = 


Ea % 

4 F* 


( 1 ) 


It follows readily from equation (6) or equation (6) (p. 427.) that the slope of 
tne tangent to the path is r 


tan a = — = — 
dx 2V' 


If the screen is a distance l cm. from the end of the condenser, the total deflection 


o _ Ea 2 Eal Ea (a , .\ 

E ^4V + 2V = 2v(-2 + l ) 


tanor(| + l). ... (3) 


. F*: n ?» p* 43L ^ lon e M a 13 small, cos a may be set equal to 1. This amounts 
to taking the velocity parallel to the field to be the same for all rays. Inasmuch 
as the projection of the orbit on a plane normal to the field is a circle all the 
electrons describe helical orbits. It is readily shown that all have the same period 

0f 2T°!2JS! e 3 uations < u > and (12) (P- 428), it follows that midHIdl 2 ) = 
(e z. This is the well-known wave equation, for whioh the period is 

given by T — 2nrncJeH. The speed v does not appear, so that after a given time 
ail particles have described one revolution and have returned to the axis. The 
subject of geometric electron optics, which concerns itself with image formation 

by electrons as brought about by electric and magnetio fields, has undergone a 
rapid development in recent years. 6 

Ex. 113, p. 455. If there are in all v, molecules of Substance 1 present, the 
number of molecules per cubic centimetre is 

where is the mass, p, the density of Substance 1. Then we have 

4rc n. 2 — l m. _ 

T = nT+2 £ = * « 

Similarly, for Substance 2, 


( 1 ) 


i* , V ~ 1 m, 

3 ^ tf+2*-* 


( 2 ) 
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4tt _ 
~S 


n 2 — 1 m 

va = — - = R. 


w + 2 p 


(3) 


Bu . t on a u c, : ount of the linear superposition of the effects of the dipoles, the average 
polarizability is calculated according to the simple rule 6 


- = Vi«, + V 2 go 


Vi 4- v 2 

Substituting this in equation (3), it follows that /? = /?,+ R 2 . 
Ex. 114, p. 455. The molecular refraction of water is 

Ti J 3 1 | Q / X 1 8 __ , 

— To • 18 = — = 3-7 cm. 3 , 


(5) 


J '° + 2 - 34 


whence 


Further, from p. 454, 


= +P - X — 7 = M3 X 10- 8 cm. 
V 4t xL 


4tt 


L * = R m> 


and, according to p. 284, a = a 3 , where a is the radius of a metallic sphere. Thus 
a is identical with the r calculated above. 

Ex. 115, p. 455. Since v 0 = 0, we obtain 


n 2 - 1 = - 


Ne 2 /m 

2rcv 2 


= — -j. i.e. u 
v 2 


c 

n 


cv 


Vv 2 


and 


1 


o , 


V 


u 


dv 


d V v 2 - C 
c/v 


C \/ v 2 - C 


V V M 


Thus the phase velocity in the Heaviside Layer is greater than c. 
Ex. 116, p. 457. We have 

4tt Np 2 K - 1 n 2 - 1 


9 kT 


K + 2 n 2 + 2 


= 0-75. 


(The value of n extrapolated to X = oo is about 4/3.) It then follows that 
■p = 0-8 x 10“ 18 e.s.u. 

This value is too small; the actual value is 1*8 X 10 -18 . The reason for this 
discrepancy is association taking place in the liquid state, in which case the 
dipoles are no longer independent. The true value is to be obtained by measure- 
ments on gases or, in other cases, on solutions in dipole-free solvents. 


* Ex. 117, p. 483. For the medium at rest, n 0 = c/v 0 ; further, from equation 
(41) (p. 244), the phase velocity in the moving medium, for an observer at rest, is 



t'o + u 

i + ss 


*SS V Q + M — 


UVp 2 

c 2 * 
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where u is the velocity of the medium (u<^ c). Then 

B = „ 0 + K ( 1 _± a ). 

Ex. 118, p. 494. p is given by equation (6); there results p = 232 atm. 

Ex. 119, p. 500. From the definition of x, this quantity for an ideal gas is 
given by 


For a van der Waals gas, 


*id = -• 
P 


Xreal — 


* 1 V, b a\ 

BTv 2* A , b \ a pV v + pv %)' 

{v ~ b) 2 ^ PV + v)-* 


Ex. 120, p. 510. Setting the coefficient of Av, as given by (34), equal to zero 
and combining with (26), p. 499, we have 

s’j- 

ExprMsing <0 in terms of t and substituting in the reduced equation of state, 
we obtain the equation of the transition curve: 

n = -27 + 24V3 t* - 12 t. 

12 i' P ' 51 j* no , means! Electrical energy is equivalent to mechanical 

W k f and a,CC0 \ dmg fche second law one can, by expending work, transfer 

of ^ ™ (Say a body of water ) a warmer one (a boiler) 

of. p. 617 - J* r a = 283 and T x — 293°, then r\ = 10/293 = 1/30 and W = O /29 

? r f 29 ^; Thus the thermal effectiveness may be increased, theoreticallv 

by a factor of ahnost 30. “ Heat pumps ” of this kind are coming into use • the 

actual efficiency is, of course, less than the figure given. ' ne 

? 22 ’ P * 624 ‘ Starting at “ arbitrary point, say 0° C., the entropy before 


S ' 


i^ao 0 ^ ° f Water 6 = l) ' Afler the ** temperature fa 43 3“ C„ 

°-w 3 


and^havo 0 diBer ° aC0 ’ the ™ tial temperature 273 drops out-as it mu st do- 
S i -S l = 30 log 316-3 - 10 log 373 - 20 log 288 = 0-23 cal. /deg 

r»IISiii3 
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Ex. 124, p. 524. The areas under the two arcs must be equal, for it must be 

impossible to construct a perpetuum mobile of the second kind by traversing this 
part of the curve. 6 


Ex. 125, p. 528. We have 

TdS = dU + ®de ~(^)dT + + <l>de 



Comparing coefficients, 


(dS\ _ l /8U \ 

\dT/ e T\dTJ e 




Differentiating equation (2) with respect to T, equation (3) with respect to e, 
we have 


I r d 2 U 


1 ” 

T Ue dT 


+ 


©.] - i [( 


52 ) 

deJ T \ TdTde 


(4) 


whence « = - (£) + T (g), or ® - T (g) = - (%) 


(5) 


If the Faraday equivalent charge F has passed, one gramme equivalent has been 
converted and the energy difference is 
divided by the valence Z, i.e. equal to —q 0 ~/Z. Then 

dU\ _ <7o 

de ) T ZF 


equal to the heat of reaction per mol 



which is Helmholtz’s equation for the galvanic cell. Observe that ZF<t> corre- 
sponds to the free energy. 


Ex. 126, p. 628. It follows from the first law, at constant pressure. 


whence 


Sq = du - {- pdv = d(u + pv) = dh = ( 


dh\ 

dTJp 


dT, 



By analogy with equations (34) to (37) on pp. 527-528 it follows that 



(dh — vdp) 
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Through comparison and “ crosswise differentiation ” there results 

W'* L &T J; 

Ex. 127, p. 630. In equation (38) (p. 528), replace p by H and v by — l . The 
result is 



and with the equation of state l = ( C/T)H , we get 

IH H* 

q\ X !J\ * 

where x = susceptibility per mol and c, is the ordinary specifio heat. 

* Ex. 128, p. 584. For T = 0, i.e. for x -*■ co, x tanh x — log cosh x becomes 
equal to 2, and so = -R log 2. StatisticaUy, J7 mag = 1, * mag = 0. 

80 that *unmag = kL log 2 = R log 2. 

Thus the entropy, in agreement with the above, is an amount R log 2 greater in 
the unmagnetized state than in the magnetized state. 


P* 608 * Tbe ener gy levels of the rotator are given on p. 681. In 
the old quantum mechanics, 

m 8 A a 

Ufn ~ (1) 

by ttoSSSES wSfftr the number “ th8 mth quantum 8tate “ 

y _ Le-mW/Sn'IkT Le-m'x 

m ~ 'L m €-m'h'IS*'IhT - Z m e-mV (2) 

fr^ed^m 7?!**™' Rem l ombe i rin g that there are two rotational degrees of 
freedom, i.e. two axes normal to the axis of figure, the total energy ia 

U r = 2LkT 

W) 

5SSf oTl ££ £»?* W ° oomputation-the 


c r = 2fi^giL°g?mgZ!!!) 

dx* 


( 4 ) 
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ture of two modifications— ortho- and para-hydrogen— one of which can possess 
only even, the other only odd quantum numbers. For details consult Eucken’s 
Lehrbuch der ckemischen Pkysik , Edn. 2, II, 1, p. 253 (1943). 

* Ex. 130, p. 624. According to p. 616, radiation incident at an angle 0 exerts 
a pressure amounting to p 0 = (2S/c) cos 2 6 and so in an angular interval dQ, 
dp = 2rc(2S/c) cos 2 0 sin 0 dd. Integrating from 0 to n yields p = SnS/3c. But S v 
is identical with K v in equation (15), and so p v = pS/3 and p = p/3. Now, at 
temperature ( T + dT), let a piston be pushed forward by radiation pressure of 
amount p -f- {dpfdT)dT until the volume increases by v ; then follow with adia- 
batic expansion that lowers the temperature from ( T + dT) to T. Finally, 
perform a compression at temperature T. The work done is v{dpldT)dT, while 
heat energy of amount 

(* + &")• + (■* + &■“')• 


was supplied in the expansion. Neglecting quantities that are small compared 
with p and p, the second law yields 

d P m _(P + ?) d T __ dp 4p 

U1 m » 0r — 


dl 


T 


dT T' 


Integration gives the result 


o = aT 4 . 


* Ex. 131, p. 636. The atomic weight of electrons is 1/1840, hence the chemical 
constant is 

• = -6-48. 


Written in exothermal form, the reaction is 

Ion -F Electron = Atom + q 0 ~. 


The quantity q Q ~ is calculated from the ionization potential V i according to 



eV { L 

4 19 X 10 7 X 300 


23,030 V { . 


The chemical constants of the ion and atom are equal; also, the specific heat of 
each of the monatomic gases is 512/2. If the degree of dissociation (ionization) is 
a, the partial pressure of the ions is 


that of the electrons is 


and that of the atoms 


Pi = 


a 

1 + a 


Pe = 


a 

r+a 


Pa = 


1 + a 
1 - a 





since, in all, (1 -f* a )^ particles result from N neutral atoms. 
(44) (p. 541), 



4-57 T 


+ ®logJ’- 6-48. 


Then, by equation 
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For Ca (ionization potential 6 volts) at 6000° and 1 atm. we obtain a = 8 per 
cent; at 6000° and 10 -2 atm., a = 64 per cent. Thus, observation of the relative 
intensity of the lines of Ca and Ca + in stellar spectra furnishes us with the possi- 
bility of determining the conditions of temperature and pressure. This was 
first recognized by M. N. Saha, to whom the theory in its simple form is due. 
Consult Handbuch der Astrophysik, Vol. I, Berlin, J. Springer (1929). 

* Ex. 132, p. 674. 

LS coupling: 8 X -f- s 2 yields (a) S = 0 (singlet system), (6) S = 1 (triplet 

system); l x -f l z yields L = 1. (a) gives rise to the term 

l P v and ( b ) to the three neighbouring terms 3 P 0 , *P V 3 P 2 . 

jj coupling: Sl + h yields (a) j x = £, (6) j x = § ; s 2 + l 2 yields j 2 = £. 

(a) gives rise to two neighbouring terms having J = 0 and 
J = I, and (6) to two terms close together having J — 1 
and J = 2. 

Thus LS coupling produces a grouping of 1 and 3 terms, jj coupling produces 
two pairs of terms. 

* Ex. 133, p. 674. In the longitudinal Zeeman effect the absorption frequency 
for right-handed circularly polarized light is somewhat different from that for 
left-handed circularly polarized light. The component which rotates in the same 
sense as the current which would cause the magnetic field and which, by p. 663 , 
is displaced toward higher frequencies is designated the right-handed component. 
In the normal effect, 

AV== 4^^’ ie * V °’*- = V ° + Av ’ v o> I — v o ~ Av. . . (1) 

Now this displacement of the characteristic frequencies causes a change in the 
refractive index (p. 451); and a difference in refractive index — i.e. in phase 
velocity — for the right- and left-handed rays means, according to p. 65, a rotation 
of the plane of polarization in the direction of turning of the faster-moving wave, 
i.e. the wave with the smaller index of refraction. The angle of rotation is equal 
to half the phase difference, so that for a layer of thickness D we have 

8 2 kvD 

* = r it _ n '> (2) 


According to the dispersion formula in its simplest form 


n, — 1 = 


2tt[(v 0 - Av) 8 - v*V 


n r 1 ~ 


Nf r e*/m 


MK + Av) 8 - v 8 ]* 


( 3 ) 


At higher temperatures — a condition always satisfied when investigating gases 

both components are of the same intensity, so that /, = f r = // 2, where / is the 
strength of the oscillators for the uninfluenced line. At lower temperatures, at 

wnifih t.hft linft annofro aP ~ — ji r> . « r. * , 




may, on account of the Boltzmann distribution of orientations, be considerably 

f ofcher ’ 80 th ^./r and /, wffl be different, and this can cause the 
sign of the effect to reverse, giving the so-called “ paramagnetio Faraday Effect 
Considermg only the normal case where f l = f r — ff 2, we have 

n, — n r — ^2*1 { 1 i 1 I _ A 7 /e a 4Av . v a 

4 wm - A v] a - v* [v Q + A v] a - v*/ 4 t m * (v 0 * - v 8 )* 
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and so 



Nfe*vv 0 

4tt m 2 c 2 (v 0 2 - v 2 ) 2 


HD. 



Thus the rotation is proportional to H. Similarly, we find that in transverse 
observation, on account of the equality of the intensities of the parallel and 
perpendicular components, there is no linear effect but rather a small quadratic 
one. In the same way, the Kerr effect is found to be given as a consequence of the 
Stark effect. 


Ex. 134, p. 743. According to equations (5) and (6) of the text we have, for 
a given velocity, dE/dx proportional to z 2 but independent of M , and dE pro- 
portional to M , so that the range will be proportional to Mz~ 2 . Thus R a = &(4/4) 
and R H = A:( 1*0081/1 ), from which R B = 1-008 R a ; i.e. for the same velocity the 
ranges are nearly equal. But we wish to know, from the ranges, the energies of 
the particles. Because the energy, for a given velocity, is proportional to the mass 
we have that 


E n 1-008 
E*~ 4 


or Eg = 3-02 Mev. 


This is the proton energy corresponding to a range of 14-53 cm. For heavy particles 
of these energies, relativistic effects may be neglected. 

Ex. 135, p. 763. (a) P” decay (neutron-*- proton + Z -*■ Z + 1): Nucleus 
Z stable if M :+1 -f- m e \ > M z or, after addition of Zm^u A z+l > A v 

(b) P + decay (proton -*■ neutron -f* @ + » Z-*Z — 1): Stability of nucleus Z 

if M z _ x + mei > M z or, after adding Zm<> j, A z _, + 2m e i > A z . 

(c) A -capture (proton + neutron, Z -* Z - 1): Stability of nucleus 2 

if M z _ x > M t + m e i or, after addition of (Z — l)mei, A z - X > A z . 

Stability with respect to (a) and (c)— hence also for (6)— is assured if A z < 
A z _ x and A z < A z+1 . 

Ex. 136, p. 819. Putting p = & in (31), p. 813, and using (43), p. 818, 

,V, /2 = V 2/nx (sin xjx — cos x). 
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Abaric point, 846, Ex. 74. 

Abbe’s sine law, 403. 

Aberration, 246. 

Absolute electrometer, 292. 

— space, 228. 

— velocity, 230. 

— zero, unattainable, 659. 
Absorbing media, 357. 

Absorption frequency, 453. 
Absorptive power, 620. 
Accelerated reference frame, 231. 
Acceleration, 82. 

Action, 123. 

— at a distance, 286. 

— variable, 129. 

Adiabatic change. 506. 

— law. 212. 

— lines, 607. 

— principle, 662. 

Aeromechanics. 191. 

Affinor, 33 n. 

Alpha-particle, 016. 

— scattering of, 641. 

Alpha-ray, 746. 

Alternating current. 316. 

transient phenomena. 319. 

Alvarez, 772. 

Ampere’s theorem. 305. 
Amplitude, 60. 

Anfahrwirbel, 211 n. 

Angular frequency 60. 

• — momentum. 89. 

total, 110. 

Anisotropic substances, 278. 
Ankylosis, 698. 

Antecedent, 36. 

“ Anti-Stokes ” line. 721. 
Apparent force, 231. 

Arc spectra, 674 n. 

Argand diagram, 69. 

Associated harmonics, 700, 821. 
Astigmatio difference. 409. 
Aston’s isotope rule, 762. 

Atled operator 30 n. 


Atom, model of, 647. 

— structure of, 641. 

Atomic mass unit, 750. 

— number, 431, 646. 

— observations, inexactness of, 690. 
Avogadro’s law, 495. 

— number, 420, 572, 598. 

Axes, optical, 369. 

Axis of figure, 163. 

Babinet’s theorem, 380. 

Balmer series of spectral lines, 649. 

Band group, 685. 

— spectra, 680. 

— system, 685. 

Band, 731. 

Bars, longitudinal waves in, 180. 

Beats, 67, 323. 

Bernoulli’s equation, 197, 203. 

Berthelot’s principle, 513. 

Bessel functions, 810. 

asymptotic forms of, 817. 

of second kind, 816. 

of third kind, 817. 

orthogonality of, 815. 

Beta-ray, 745. 

Binormal, 19. 

Biogenetics, fundamental law of, 675. 
Biot-Savart law, 110, 206, 207, 300, 303, 
337, 338. 

Black body, 613. 620, 624. 

Bloch and Alvarez, 772. 

Bohr magneton, 665, 744. 

— model of atom, 647. 

— theory, 707. 

Boiling-point, elevation of, 646. 
Boltzmann oonstant, 452, 693. 

Bond, non-polar, 712. 

Born, M., 706. 

Bose-Einstein statistics, 624. 

Boundary layer, 210. 

detachment of, 210, 211 n. 

waves in, 346. 

Boyle and Mariotte, law of, 494, 565. 
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Boyle’8 law, 212. 

Brackett series of spectral lines, 650. 
Bragg law, 38S. 

— method, 389. 

Breakdown, electrical, 436. 
Bremsstrahlung, 743, 779. 

Brewster's law, 353. 

Brownian movement, rotational, 602 n. 
Bulk modulus, 173. 

Bunsen’s effusometer, 842, Ex. 61. 

Calculus of variations, 75. 

Calorie, gramme, 487. 

— large, 487. 

— small, 487. 

Canal rays, 440. 

Doppler effect in. 441. 

Canonical transformation, generator of, 
127. 

— transformations, 124. 

Cantilever beam, bending of, 176. 
Capacity, 274. 

Carnot cycle, 514. 

Cathode drop, 437. 

— rays, 426, 427. 

origin of. 440. 

Cations, 419, 441. 

Cauchy integral theorem, 72. 
Caucky-Riemann conditions, 71. 

equations, 201. 

Caustic, 406. 

Cavity, 613. 

Central axis, 144. 

Centre of gravity, 109. 

— of inertia, 109. 

— of mass, motion of, 108. 

Characteristic curve of thermionic tube, 

799. 

— functions, 186. 

— radiation, infra-red, 607. 

— values, 185. 

of wave equation. 698. 

Charge density, 703. 

— surface density of, 266. 

Chemical affinity, 553. 

— bond, nature of, 712. 

— constant, 541 636. 

of gas, 538, 557. 

— equilibrium, 543. 

— valence, 716. 

Circuits, inductively coupled, 322. 
Circularly polarized waves, 64. 
Circulation, 202, 203, 210, 211. 

Classical quantum statistics, 603. 


Classical statistics, 579. 603. 
Clausius-Clapeyron equation, 533. 

Cloud chamber, 740. 

Wilson's, 642. 

Coefficient of emission 619. 

Coll inear system, 403. 

Combination lines, 660. 

Compensator, 357. 

Complex potential, 201, 209, 210. 

— velocity. 203. 207, 208, 210. 

“ Complexion ", 582. 

Compressibility, isothermal, 493. 
Compton effect, 6S7, 690, 691, 743. 
Conditionally periodic systems, 132. 
Conducting media, propagation in, 333. 
Conduction, electrolytic, 417. 

— metallic, 444. 

— of electricity in coses, 425. 

— of heat, 570, 571. 

in metals, 447. 

Conductivity, 294. 

— electrical, 631. 

— equivalent, 421. 

— surface, 489. 

— thermal, 488. 

— thermometric, 489. 

Conformal mapping, 69, 70. 

Conjugate complex velocity, 201. 

— tensors, 35. 

Consequent, 35. 

Conservation of energy, principle of, 88. 
Conservative forces, 87. 

Constrained particle, 103. 

Contravariant components, 12. 
Convection coefficient, Fresnel's, 477. 
Convergence ratio, 401. 

Co-ordinates, curvilinear orthogonal, 40. 
Coriolis acceleration, 233. 
Correspondence principle, 655. 
Corresponding states, theorem of, 500. 
Cosmic-ray showers, 781. 

Cosmic rays, 780. 

Coulomb, 267. 

Coulomb's law, 266. 

Couple, 144. 

Coupled circuits, 322. 

Coupling, Russell-Saunders, 669. 
Covariant components, 12. 

Crystal, biaxial, 373. 

— cohesive forces in, 574. 

— compressibility of, 577. 

— lattice, 360. 

— uniaxial, 370. 

Crystals, properties of, 573. 
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Cubical dilation, 166. 

Curie and Joliot, 755, 756. 

— constant, 458. 

— law, 529. 

— point, 460, 461. 

Curl of a vector, 25. 

Current density, 293, 726. 

— strength, 293. 

Curves of equal thickness, 411. 

Cut, 73 n. 

Cyclic variables, 126. 

Cylinder functions. See Bessel functions. 

d’Alembert’s principle, 113, 116. 

Dalton’s law, 496. 

Damping, 320, 348. 

Debye-Hiickel theory of electrolytic con- 
ductivity, 421, 423. 

Decomposition of lithium, 750. 
Deformable solids, mechanics of, 161. 
Degenerate system, 136. 

Degree of degeneracy, 136. 

Del operator, 30 n. 

Delaunay elements, 136. 

Density of charge, 266. 

— of ideal gas, 580. 

Deuteron, 653, 743, 761. 

Dia-electric susceptibility, 280. 
Diamagnetic susceptibility, 463. 
Diamagnetism, 461. 

Diatomic molecule, 685, 605. 

— molecules, rotation spectra of, 702. 
Dielectric constant, 267 n., 277. 

electron theory of, 450. 

— constants, principal, of crystal, 362. 

— displacement, 277. 

— media, 277. 

— polarization, 461, 473. 

Dielectrics, polarization of, 279. 
Diffraction at circular aperture, 390. 

— at slit, 382, 390. 

— optical, 696. 

— reciprocal theorems of, 379. 

— theory of, 376. 

Diffuse series, 660. 

Diffusivity, 489. 

Dilatation, Einstein time, 243. 

Diopter, 403. 

Dipole, 272. 

Discord, 99. 

Dispersion, 66, 461, 716. 

— anomalous, 463. 

— region of, 413. 

Displacement ourrent, 327, 328, 476. 


Divergence of a vector, 23. 

— theorem, 24. 

Domains, magnetic 462. 

Doppler effect, 235, 245, 691. 

Double layer potential of, 273. 

Doublet, 273. 

— magnetic, 298. 

Droplets, 263. 

Dulong-Petit law of atomic heats, 598. 
Dyad, 35. 

Dyadic products, 36. 

Dynamics of rigid bodies, 143. 

Earth’s horizontal magnetic field, in- 
tensity of, 297. 

— pole, wandering of, 155. 

Eccentric anomaly, 137. 

Efficiency of heat engine, 517. 
Effusometer, Bunsen’s, 842. 
Eigenfunktion, 186 n. 

Eigonwert, 186 n. 

Einstein-de Hass effect. 111. 

anomaly in, 669. 

— time dilatation, 243. 

Einstein’s velocity addition theorem, 244, 
245. 

Elastic potential, 175, 176. 

Elasticity, 161. 

Elastodynamics, 161. 

Elastomers, 803. 

Elastostatics, 161. 

Electret, 473. 

Electrical charge as source of flux, 264. 

— oscillator, 338. 

Electrocaloric phenomena, 628. 
Electrodynamics of moving bodies, 471. 
Electrolytic theory of Arrhenius, 422. 
Electromagnetic equations, invariant form 

of, 480. 

— waves, 360. 

in conducting media, 333. 

in two media, 343. 

propagation of, 328. 

Electrometer, absolute, 292, 

Electron, 426, 429. 

— mass, variability of, 476. 

— optics, 443, 787. 

— revolving, 463. 

— spinning, 669. 

Electrons, secondary, 433. 

— thermal emission of, 433. 
Eleotrophoretio force, 423. 

Eleotrostatio potential, 267. 

Elementary cell, 674. 
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Ellipsoid, Fresnel, 363. 

— index, 363, 366. 

— of stress, 169. 

Ellipsoid, tensor, 363. 

Elliptic motion, left-handed, 59. 

right-handed, 59. 

Elliptically polarized waves, 64. 
Emissivity, 489, 619. 

Enclosure, 613. 

Endothermal reaction, 540. 

Energy, 85. 

— bands, 725. 

— density, 175, 287, 616. 

— distribution, 588. 

— function of a gas, 503. 

— inertia of, 252. 

— internal, 627. 

Enthalpy, 509. 

— free, 525. 

Entrance pupil, 785. 

Entropy, 522, 524, 540, 556, 579. 

— constant, 636. 

Equation of state, 493, 527. 

of ideal gas, 494. 

of real gas, 497. 

Equations of motion, canonical form, 122. 
Equilibrium, chemical, 539. 

— of disintegration products, 746. 

— of sedimentation, 589. 

— of thermodynamic systems, 531. 

— thermodynamic, 619. 

Equipotentials, 201, 202. 

Ether, 259. 

Euler formula, 60. 

Euler- Lagrange differential equation, 75, 
77. 

Euler-Maschoroni constant, 809. 

Eutectic points, 634. 

Exchange energies, 712. 

— forces, 712. 

Excitation potential, 649. 

Exit pupil, 785. 

Expansion, isobaric volume coefficient of, 
493. 

External forces, 108. 

“ Falling characteristic ” of arc, 440. 
Faraday effect, 674, Ex. 133. 

paramagnetic, 863, Ex. 133. 

— equivalent charge, 554. 

Fermat’s principle, 395, 407. 

Fermi-Dirac statistics, 625, 631. 

Fermi gas, 769. 

Ferromagnetic materials, 305, 306. 


Ferromagnetic media, 343. 

— metals, 299, 460. 

Field equations, Hetz’s solution of, 335. 

— strength, electrical, 263. 

Fields, quasi -stationary, 311. 

— rapidly alternating, 327. 

— slowly varying, 311. 

“ Fine structure ”, 009. 

First law of thermodynamics, 495. 
Fission of heavier nuclei, 767. 

Fizeau effect, 245. 

Fizeau’s experiment, 477. 

Fleming’s left-hand rule, 309. 

Flow, irrotational, 196, 197. 

— laminar, 214. 

— rotational, 196. 

Flux, 264. 

Focal length, 399. 

of electron lens, 703. 

— plane, 396. 

Focus, principal, 420. 

Force, 84. 

— moment of, 110. 

Forced vibrations, 96. 

Fortrat, 684. 

Foucault pendulum, 235, Ex. 65. 
Four-dimensional world, 247. 

Four-force, 256. 

Four-velocity, 256. 

Fourier integral, 54, 56. 

— series, 54, 101. 

multiple, 132. 

Fraunhofer diffraction phenomena, 382. 
Free charge, 277. 

— energy, 525. 

— oscillations, 320. 

— path, 438. 

Freezing-point, lowering of, 546. 
Frequency, 50. 

Fresnel diffraction phenomena, 382. 
Fresnel's formulae, 350, 352. 

Frictional resistance, 94. 

Funicular polygon, 145. 

Galilean transformation, 229, 230. 
Galvanometer, moving-coil, 309. 

Gamma functions, 808. 

— -ray, 745. 

Gas constant, 693. 

Gauss plane, 69. 

Gauss’s theorem, 23, 270. 

Gay-Lussac’s experiment, 504. 

— law, 494. 

Geigor-Miiller counter, 741. 
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Geiger-Nuttall formula, 747. 

Generalized co-ordinates, 118. 

— momentum co-or dinates. 12^ 
Geodesic, 76. 

Geometric velocity, 245. 

Giorgi system of units, 266. 

Glancing angle, 389. 

Goudsmit and Uhlenbeck, 669. 

Gradient, 21. 

Gravitational mass, 259. 

Green’s theorem, 270. 

Grid, 434. 

Ground wave, 346. 

Group velocity, 65, 245, 694. 
Gyrocompass, 159. 

Gyroscope, 152. 

Hahn and Strassmann, 754. 

Half-value period, 746. 

Hall effect, 734. 

Halley’s formula, 192. 

— law, 689. 

Hamilton- Jacobi differential equation, 
126. 127, 130. 

Hamilton’s characteristic function, 128. 

— equations, 121. 

— Principle, 123. 124, 693. 

Hankel functions, 817. 

Harmonic analyser, 65 n. 

— vibrations, 93. 

Heat conduction, differential equation of 
488. 

— engine efficiency, 517. 

— function, 609. 

— of fusion, 539. 

— of sublimation, 539. 

— of vaporization, 536. 

— specific, 487. 

— theory of, 484. 

Heaviside layer, 466, 868, Ex. 116. 
Heisenberg, 706. 

Heitler and London, 712. 
Helmhotz-Lagrange equation, 406. 
Helmholtz’s vortex theorems, 205. 
Herpolhode, 141, 142. 

Hersohel effect, 738. 

Hertzian oscillator, 336. 

— vibrator, 338. 

Hess’s law, 512. 

Hittorf s relation, 420. 

Hodograph, 81. 

Holonomous-rheonomon8 system, 121 . 
Holonomous systems, 114 . 


Hooke’s law, 171. 

Huygens’ formula, 376. 

— principle, 368. 

Hydrodynamical equations, 196. 
Hydromechanics, 191. 

Hydrostatics, 191. 

Hyperbolic orbits, 702. 

Hyperfine structure, 743. 

Hysteresis loop, 461. 

Image distance, 399. 

— space, 397. 

Images, electrical, 275. 

Impedance, 318. 

Impulse, 86. 

Incidence, principle angle of, 357. 
Index (of complex number), 60. 
Indicator diagram, 516. 

Induction, 264, 473. 

— law, 311. 

— magnetic, 298. 

Inertia, 83. 

— of energy, principle of, 258. 

— product of, 149. 

Inertial force, 231. 

— frames, 229. 

— mass, 84, 264, 259. 

— resistance, 167. 

Insulating media, 349. 

Intensity of magnetization, 298. 

— of wave, 724. 

Interference fringes, 396, 410. 

— instruments, 410. 

Interferometer, Fabry-P6rot, 411, 412. 
Internal forces, 108. 

Ion, 418. 

“ Ion cloud ”, 550. 

Ionization bursts, 765. 

— photo-, 432. 

— potential, 432, 649. 

— thermal, 432. 

Irrotational flow, 196, 198. 

— region, 197. 

Iso bares, 759. 

Isogyre, 375, 853, Ex. 98. 

Isometric pressure coefficient, 493, 557. 
Isothermal compressibility, 493. 

— lines, 498. 

— volume elasticity, 493. 

Isotope of hydrogen, 743. 

Isotopes, 746. 

Isotopic mass, 765 n. 

. Isotropic subatanoe, 171, 278, 



880 


INDEX 


Jacobian function, 107. 

Joule heat loss, 296. 

Kater's pendulum theory of, 151. 

Kelvin temperature scale, 518. 

Kepler ellipse, 656, 659. 

Kepler’s first law of planetary motion, 91. 

— second law, 89. 

— third law, 92. 

Kerr effect, 864, Ex. 133. 

Kinematics, 81. 

Kinetic energy, 86, 112, 113. 

— theory of matter, 563. 

Kirchhoff’s formula, 376, 378. 

— laws, 294, 620. 

Kutta-Joukowski lift formula, 208, 210. 

Lagrange’s equations of the first kind, 
113, 117. 

of the second kind, 118, 120. 

Lambert’s law, 617. 

Laminar flow, 214. 

Langevin function, 463. 

Laplace’s equation, 71, 269. 

Larmor frequency, 772. 

— precession, 664. 

Lattice constant, 574. 

— defects in dielectric crystals, 735. 

— energy, 576. 

Laue-London theory, 466. 

Law of areas, 89. 

— of equipartition of energy, 595. 

— of gravitation, 89, 90. 

— of mass action, 539. 

Le Chatelier’s law, 542 n. 

Left-handed elliptic motion, 59. 

Legendre functions of second kind, 824. 

— polynomials, 699, 820. 

orthogonality of, 822. 

Level surfaces, 21. 

Light quantum, 432. 

— principal velocities of, in crystal, 
362. 

Line integral, 22, 72. 

— of nodes, 136, 137. 

— spectra, 680. 

Linearly polarized waves, 64. 

Lines of force, 264. 

— of motion, 194. 

Liouville’s theorem, 585. 

Liquid films, 227. 

Lissajous figure, 68, 60, 132. 

Local time, 265. 

Logarithmic decrement, 96, 100. 


Longitudinal mass, 476. 

— waves, 64. 

in bars, 180. 

Lorentz contraction, 243, 244. 

— transformation, 240, 249, 250. 

consequences of, 242. 

— triplet, 667. 

Loschmidt’8 number. See Avogadro's 
number. 

/i-mesons, 779. 

Macro-state, 627. 

Magnetic domains, 462. 

— lens, 431, Ex. 112. 

— permeability, electron theory of. 450. 

— reluctance, 308. 

Magnetization by rotation, 464. 
Magnetized shell, 303. 

Magnetocaloric phenomena, 528. 
Magnetomechanical parallelism, 464. 
Magnetomotive force, 308. 

Magneton, Bohr, 459, 665, 744. 

— Weiss, 459. 

Magnetostatic field, 297. 

Magnification, angular, 401. 

— in depth, 400. 

— transverse, 399. 

Magnus effect, 211. 

Malus, law of, 397. 

Manometer method, 290. 

Mass action, law of, 422, 539, 542. 

— correction, 765 n. 

— defect, 765 n. 

— number, 765 n. 

— of photon, 688. 

— spectrograph, 441, 442. 

— variability of, 252. 
matrix calculus, 46. 

Mattauch’s laws, 761. 

Matter waves, 693. 

Maxwell-Boltzmann law, 588. 

Maxwell’s equations, 311, 312, 467. 

— relation, 331. 

Mayer’s equation, 505. 

Mean anomaly, 137. 

— free path, 567. 

— relative velocity, 566. 

Media, anisotropic, propagation in, 360. 
Meissner and Ochsenfeld, 466. 

Meitner, 778. 

Melting-point curve, 534. 

Membranes, 183. 

Mesons, 778. 

Metals, optics of, 357. 
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Metals, properties of, 725. 

— reflecting power of, 359. 

Michelson-Gale experiment, 478. 

Morley experiment, 237, 477. 

— stellar interferometer, 375, 853, Ex. 98. 
Micro-state, 627. 

Millikan oil-drop method, 425, 426. 
Mobility of an ion, 419. 

Modu’ated vibrations, 57. 

Modulus of compression, 173. 

— of elasticity, adiabatic, 508. 

isothermal, 508. 

Molecular diameter, 572. 

Moment of deviation, 155. 

— of inertia,146-9. 

principal, 150. 

— of torsion, 147, 297. 

Momentum, 84. 

Monogenic function, 69. 

Moseley’s law, 655. 

Moving bodies, electrodynamics of, 471. * 
Moving coil, induction in, 472. 

Multiple products, 15. 

Multiplets, theory of, 669. 

Multiply periodic systems, 128. 

Muscular action model, 805. 

Mutual inductance, 313. 

— induction, 313. 

coefficient of, 313. 

Nabla operator, 30. 

Nernst heat theorem, 553. 

Neumann function, 830, Ex. 26. 

Neutrino, 778. 

Neutron, 762, 767. 

— mass of, 763. 

Newton (unit), 266. 

Newton’s laws of motion, 84. 

— third law, 108. 

Nicol prism, 374, 375. 

Non-conservative forces, 88. 

Non-divergent field, 302. 

Non-irrotationa! motion, 198 n. 

Normal acceleration, 82. 

— surface, 368. 

Nuclear attraotion, 764, 765. 

— electrons, objections to, 767. 

— isomerism, 763. 

— magneton, 744. 

— moment, 743, 744, 772. 

— physics, 739. 

Nucleus, 431, 645, 646. 

— artificial transformation of, 747. 

— excitation of, 747. 


Nucleus, liquid-drop model of, 763. 

— motion of, 651. 

Nutation, 154. 

— cone of, 158. 

— of force- free top, 158. 

Object distance, 399. 

— space, 397. 

Oersted (unit), 298. 

Ohm’s law, 293. 

for electrolytes, 417. 

for metals, 446, 447. 

Onnes, H. K., 466. 

Onsager theory of electrolytic condue- 
tivity, 421, 423. 

Operator V, 30. 

Optical path, 395. 

— systems, resolving power of, 409. 
Optics, electron, 443. 

— geometrical, 395. 

— interference, 395. 

Orbits, electronic, perturbation of, 662. 

— hyperbolic, 702. 

Orthogonality, 189. 

Ortho-hydrogen, 862, Ex. 129. 

Orthotomic system, 264. 

Oscillations of plasma, 801. 

Oscillator, stabilization of, 794. 

Osculating plane, 19. 

Osmotic pressure, 547, 594. 

7r-mesons, 779. 

Packing fraction, 765. 

Parabolic velocity profile, 216. 
Para-eleotrio susceptibility, 280, 450. 
Para-hydrogen, 862, Ex. 129. 
Paramagnetic, 450. 

Parelectric susceptibility, 450. 

Partial pressures, Dalton’s law of, 496. 
Paschen-Back effect, 667, 743. 

Paschen series of spectral lines, 650. 
Paschen’s law, 436. 

Pauli and Sommerfield, eleotron theory of, 
449. 

— principle, 632, 679, 715. 

Pedal-surface, 370. 

Perihelion, 136, 137, 260. 

Periodio system, structure of, 674. 

— systems, 128. 

Permeability, 298. 

— magnetio, eleotron theory of, 450. 
Permittivity, 267. 

Perpetuum mobile, of first kind, 602. 
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Perpetuum mobile, of second kind, 617, 
619, 621, 859, Ex. 123. 

Perturbation, 731. 

— of electron orbits, 662. 

Perturbations, theory of, 707. 

Phase, 493. 

— change of, in reflection, 355. 

— difference, 51. 

— integral, 130. 

— orbit, 604. 

— space, 684. 

— velocity, 694. 

Phosphorescence, 736. 

Photo-electrons, 433. 

Photon, 688. 

Pickering series of spectral lines, 653. 
Piezoelectric constants, 792. 

— moduli, 793. 

Piezoelectricity, 792.“ 

Planck’s law of radiation, 621, 622. 

— quantum of action, 432. 

Plane circulatory motion, 207. 

— mathematical pendulum, 104. 

— wave, 62. 

Planetary motion, 90. 

Plasma, oscillations of, 801. 

Point charge, 276. 

Poiseuille’s law, 214, 215. 

Poisson’s equations, 268. 

— ratio, 171. 

Polarization, 280, 284, 350. 528. 

— dielectric, 451, 473. 

— electrical, 450. 

— elliptical, in reflection, 355. 

— linear, in reflection, 355. 

— magnetic, 460. 

— vector, 460. 

Polhode, 141, 142. 

Polygon of forces, 146. 

Ponderomotive forces, 285. 

Porous-plug experiment, 509. 

Positive rays, 440. 

Positron, 752, 754, 757. 

Potential, 87, 88. 

— barriers, 721. 

— elastic, 175, 176. 

— energy, 113, 1 14. 

— excitation, 432, 649. 

— flow, 198. 

Gibbs, 625, 627, 532 n., 541. 

— rotarded, 340. 

— thermodynamic, 524, 526. 

Power factor, 319. 

— of a Ions, 403. 


Poynting vector, 331, 332, 356. 
Precession, 157. 

— cone of, 158. 

— Larmor, 664. 

— of axis of earth, 158. 

Pressure, radiation, 613. 

Primary cosmic rays, 780. 

Principal extensions, 167. 

— normal, 19. 

— plane, 399. 

— point, 399. 

— rays, 786. 

— series, 660. 

— stress, 169. 

Principle of correspondence, 655. 

— of uncertainty, 628 n. 

Prism, Nicol, 374, 375. 

— resolving power of, 856, Ex. 108. 

— spectrograph, 409, Ex. 105, 855. 
Product of inertia, 149. 

Projectiles, deviation of, 839, Ex. 53. 
Propagation in anisotropic media, 360. 
Proper functions, 186. 

— time, 255. 

— values, 185. 

of wave equation, 698. 

Proton, 646, 757. 

Pupil, entrance, 785. 

— exit, 785. 

Quantum, light, 432. 

— of action, Planck’s, 432. 

— statistics, classical, 603. 

‘ Quantum-mechanical square ”, 683. 
Quartz vibrator, 797. 

Quasi-elastic forces, 93. 

Rabi, 772. 

Radiation pressure, 613. 

Radioactivity, 745. 

— artificial, 756. 

— induced, 756. 

Radius of gyration, 148 n. 

Raman effect, 689. 

in wave mechanics, 716. 

Raoult’s law, 546. 

Ray axes, 371. 

— surface, 369, 370. 

Rayleigh waves, 182. 

Reactance, 317. 

Rectifior, 434. 

Reference frame, uniformly rotating, 232. 
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Reflecting plane, 387. 

— power, 354. 

Reflection, optical law of, 349. 

Refraction and anisotropic medium, 371. 

— electron theory of index of, 450. 

— external conical, 373. 

— index of, 331. 

— internal conical, 373. 

— molecular, 455. 

— optical index of, 451. 

— optical law of, 349. 

Refractivity, 456. 

Relative aperture, 855, Ex. 107 . 
Relativistic mechanics, 228. 

Relativity, generalized theory of, 258. 
Reluctance, magnetic, 308. 

Residue, 74. 

Resistance, 293. 

Resistivity, 294. 

Resolution, limit of, 410. 

Resolving power, 384, 409. 

of prism, 856. 

Resonance, 96, 99, 319. 

— curve, 99, 102. 

— energies, 754, 762. 

Retarded potential, 340. 

Reversible process, 503. 

Revolving electrons, 463. 

Reynolds’s number, 215. 
Richardson-Einatein-de Haas effect, 464. 
Right-handed elliptic motion, 69. 

“ Ritz term ”, 659. 

Rontgen and Eichenwald, experiments 
of, 475. 

Rotating earth, free fall on, 231. 

Rotation of a vector, 27 n. 

Rotation- vibration bands, 682. 

Rowland’s experiment, 474. 

Rutherford, 747. 

Rydberg number, 651. 

“ Rydberg term ”, 659. 

Sagnao’s experiment, 478. 

Sargent’s formula, 747. 

Saturation, 461. 

— current, 433. 

Scalar, 7. 

— product, 10. 

Scattering power, 719. 

Schmidt limits, 774. 

Schmidt-Schiller curves, 774. 

Schrfidinger, 706. — 

— equation, 696, 722, 726, 727, 774. 
Scintillation method, 740. 


Scleronomous systems, 114. 

Screw motion, 142. 

Second law of thermodynamics, 514. 
Secondary cosmic rays, 780. 
Self-inductance, 314. 

Self-induction, 313, 314. 

— coefficient of, 314. 

Shadow, geometric, 382. 

Sharp series, 660. 

Shear modulus, 173. 

Shell integral, 14. 

Signal velocity, 66. 

Simple harmonic vibration, 60. 

— pendulum, 103. 

Simultaneity, 242. 

Sine condition, 785. 

Single-ray direction, 373. 

Singular point, 73. 

Skin effect, 324. 

Smoke rings, 206. 

Snell’s law, 350. 

Soap bubble, 227. 

Solenoidal field, 289. 

Solid state, 607. 

Sound waves, 212. 

Space curves, 17. 

— relativistic conception of, 240. 
Spark discharge, 433, 436. 

— spectra, 674 n. 

Specific heat, 603, 529, 595, 606. 

of solids, Debye’s theory, 608. 

rotational, 861, Ex. 129. 

— inductive capacity, 267 n., 276. 

— resistance, 294. 

Spectra, aro, 674. 

— rotation, 702. 

Spectral lines, 719. 

intensity relations of, 703. 

Spectrograph, prism, 409, 855. 
Spectrum of alkalis, 657. 

— of hydrogen, 652. 

— of ionized helium, 652. 

— order of, 384. 

— X-ray, 663. 

“ Sphere of influence ”, 566. 
Spherical condenser, 274. 

— harmonics, 699, 819. 

relations between, 820. 

— wave, 63. 

Spin of eleotron, 669. 

Spontaneous magnetization, 462 . 
Standing waves, 67. 

Stark effeot, 475, 662. 

Starting vortex, 211. 
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Statistics, Fermi-Dirac, 625, 631. 

— quantum, classical, 603. 
Stefan-Boltzmann law, 624. 

Steiner's theorem, 148. 

Stern and Gerlach, experiments of, 665. 
“Stokes ” line, 721. 

Stokes resistance, 418, 419, 425. 

Stokes’s law, 218. 

of fluorescence, 721. 

— theorem, 25, 72, 203, 204. 

Strain, 162. 

— ellipsoid, 171. 

— tensor, 171. 

Stream function, 200. 

— lines, 194, 201. 

Stress, 167. 

— tensors, 171. 

Strings, transverse vibration of, 183. 
Subordinate series, first, 660. 

second, 660. 

Superconductivity, 466-70, 635. 
Superposition of plane waves, 68. 

Surface brightness, 614. 

— curl, 345. 

— invariant, 404. 

— tension, 222, 223. 

— waves, 182. 

— zonal harmonics, 699. 820. 

Surfaces of discontinuity, 43. 
Susceptibility, 280, 450, 453. 

— diamagnetic, 463. 

— forromagnetic, 458. 

— paramagnetic, 458, 669. 

— parelectric, 280, 455. 

Swinne formula, 747. 

Symbols in nuclear physics, 748 n. 
Systems of particles, 108. 

Tangent vector, 18. 

Tangential acceleration, 82. 

Taylor’s series, 76. 

Temperature, critical, 499. 

— scale, thermodynamic, 518. 

Tensor analysis, fundamental principles 
of, 32. 

— antisymmetric, 36. 

— ellipsoid, 37. 

— invariants of, 172. 

— symmetric, 36. 

Thermal radiation, 613. 

Thermionic tube, characteristic curve, 
799. 

Thermochemistry, 511. 

Thornndynamic equilibrium, 547. 


Thermodynamic potential, 524, 526. 
Thermodynamics, first law of, 501. 

— second law of, 514. 

— third law of, 553. 

Thomson’s theorem, 204, 205, 211. 
Timbre, 187. 

Time, relativistic conception of, 240. 
Top, spherical, 152. 

— symmetrical, 153. 

— theory of the, 151. 

Torricelli’s theorem, 841, Ex. 60. 
Torsion, 20. 

Total heat, 509. 

— reflection, 354. 

Transport numbers. 420. 

Transverse mass, 476. 

— waves, 64. 

Trouton and Noble, experiment of, 475. 
“ Tunnel effect ”, 747. 

Turbulence, 215, 216. 

Turbulent motion, 198 n. 

Unbounded elastic media, waves in, 180. 
Uncertainty principle, 628 n. 

Undulatory propagation, 212. 

Unital stress, 167. 

Unit plane, 399. 

— vector, 9. 

Units, C.G.S., 266, 330. 

— M.K.S., 266, 330. 

Uranium 1, 721. 

Valve, electrical, 434. 

Van der Waals’ equation, 498. 

Van’t Hoff’s law, 547. 

Vapour pressure curve, 534. 

Variation, 78. 

Variations, calculus of, 75. 

Vector, components of, 8. 

— concept of, 7. 

— diagram, 52. 

— differentiation of, 17. 

— gradient, 36. 

— multiplication by scalar, 8. 

— polarization, 450. 

— potential, 302. 

— product,. 12. 

Vectors, addition and subtraction of, 8. 

— axial, 16. 

— equality of, 8. 

— polar, 15. 

Velocity, ray, 364. 

— resonance, 99 n. 

— vector, 81. 
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Vibration, plane of, 351. 

Virtual displacement, 114. 

— displacements, principle of, 113. 
Viscosity, 608, 671. 

— coefficient of, 214. 

Viscous fluids, 214. 

Vortex filament, 205, 206. 

— lines, 205. 

— motion, 203. 

— ring, 206. 

— strength, 205, 206. 

— tube, 205, 206. 

Wattless current, 319. 

Wave equation, 328. 

proper values, 698. 

— normal, 371. 

— surface, 368, 369, 370. 

— velocity, 61 n., 245. 

Wave-length. 62. 

“ Wave-length, De Broglie ”, 695. 
Waves, 61. 


Waves, elliptically polarized, 64. 

— linearly polarized, 64. 

Weiss law, 529. 

Wiedemann and Franz, law of, 447, 449. 
Wien’s displacement law, 622. 

— experiment, 474. 

Wilson’s cloud chamber, 642. 

Wireless telegraphy. 341. 

Work, 85. 

Work of escape, 431. 

World-tensor, antisymmetric, 251. 

World vectors, 247, 250. 

Young’s modulus, 171. 

Yukawa’s theory, 778. 

Zeeman effect, 663, 671, 742. 

anomalous, 667. 

longitudinal, 863, Bx. 133. 

Zone lens, 394. 

— plate, 390. 
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